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Alzheimer*s Disease Secretase 

EIEU) OF THE INVENTION 

Tlic present invention related to the Held of Alzheimer's Disease, APP, amyloid beta 

5 peptide, and human aspaityl proteases as well as a method for the identification of agents that 
modulate die activity of these polypeptides. 
BACKGROUND OF THE INVENTION 

Alzheimer's disease (AD) causes progressive dementia with consequent formation 
of amyloid plaques, neurofibrillary tangles, gliosis and neuronal loss. The disease occurs in 

10 both genetic and sporadic forms whose clinical course and pathological features are quite 
similar. Three genes have been discovered to date which when mutated cause an autosomal 
dominant form of Alzheimer's disease. These encode the amyloid protein precursor (APP) 
and two related proteins, prcscnilin-l (PS 1) and prcsenilin-2 (PS2), which as their names 
suggest are both structurally and functionally relared. Mutations in any of the three enhance 

15 proteolytic processing of APP via an intracellular pathway that produces amyloid beta 
peptide or the AP peptide (or sometimes here as Abcta), a 40-42 amino acid long peptide 
that is the primary component of amyloid plaque in AD. Dysrcgulation of intracellular 
pathways for proteolytic processing may be central to the pathophysiology of AD. In the 
case of plaque formation, mutations in APP, PS 1 or PS2 consistently alter the proteolytic 

20 processing of APP so as to enhance fonnaiion of Ap 1 -42, a form of the AP peptide which 
seems to be particularly amyloidogenic, and thus very important in AD. Different forms of 
APP range in size from 695-770 amino acids, localize to the cell surface, and have a single 
C-terminal transmembrane domain. The Abeta peptide is derived from a region of APP 
adjacent to and containing a portion of the transmembrane domain. Normally, processing 

25 of APP at the a-sccretase site cleaves the midregion of the Ap sequence adjacent to the 
membrane and releases the soluble, extracellular domain of APP from the cell surface. 
This a-sectetasc APP processing, creates soluble APP- a, and it is normal and not thought 
to contribute to AD. 

Pathological processing of APP at the p- and y-secretase sites produces a vciy 
30 different result than processing at the o site. Sequential processing at the p- and 7-secretase 
sites releases the Ap peptide, a peptide possibly very inqjortant in AD pathogenesis. 
Processing at the p- and y-secrctase sites can occur in both the endoplasmic reticulum {in 
neurons) and in the cndosomal/lysosomal pathway after rcintemalization of cell surface 



55 



WO0(VI7369 



PCT/US99/2088I 



APP (in all cells). Despite intense efforts, for 10 years or more, to identify the enzymes 
responsible for processing APP at the p and y sites, to produce the Ap peptide, those 
proteases remained unknown until this disclosure. Here, for the first time, we report the 
identification and characterization of the p secrctase enzyme. We disclose some known and 
5 some novel human aspartic proteases that can act as p-secretase proteases and. for the first 
time, we explain the role these proteases have in AD. We describe regions in the proteases 
critical for their unique function and for the first time characterize their substrate. This is 
the firet description of expressed isolated purified active protein of this type, assays that use 
the protein, in addition to the identification and creation of useful cell lines and inhibitors. 

10 SUMMARY OF THE INVE^mON 

Here we disclose a number of variants of the asp2 gene and peptide. 
Any isolated or purified nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta (p) secretase cleavage site of APP that contains two or more 
sets of special nucleic acids, where the special nucleic acids are separated by nucleic acids 

15 that code for about 100 to 300 amino acid positions, where the amino acids in those 

positions may be any amino acids, where the first set of special nucleic acids consists of the 
nucleic acids that code for the peptide DTG, where the first nucleic acid of the Gist special 
set of nucleic acids is, die first special nucleic acid, and where the second set of nucleic 
acids code for either the peptide DSG or DTG, where the last nucleic acid of the second set 

20 of nucleic acids is the last special nucleic acid, with the proviso that the nucleic acids 
disclosed in SEQ ID NO. 1 and SEQ. ID NO. 5 are not included. The nucleic acid 
polynucleotide of claim 1 where the two sets of nucleic acids are separated by nucleic acids 
that code for about 125 to 222 amino acid positions, which may be any amino acids.The 
nucleic acid polynucleoude of claim 2 that code for about 150 to 172 amino acid positions. 

25 which may be any amino acids. The nucleic acid polynucleotide of claim that code for 
about 172 amino acid positions, which may be any amino acids. The nucleic acid 
polynucleotide of claim 4 where the nucleotides are described in SEQ. ID. NO. 3 The 
nucleic acid polynucleotide of claim 2 where the two sets of nucleic acids are separated by 
nucleic acids that code for about 150 to 196 amino acid positions. The nucleic acid 

30 polynucleotide of claim 6 where the two sets of nucleotides are separated by nucleic acids 
that code for about 196 amino acids (positions). The nucleic acid polynucleotide of claim 7 
where the two sets of nucleic acids are separated by the same nucleic acid sequences that 
separate the same set of special nucleic acids in SEQ. ID. NO. 5. The nucleic acid 

2 
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polynucleotide of claim 4 where the two sets of nucleic acids are separated by nucleic acids 
^ that code for about 150 to 190, amino acid (positions). The nucleic acid polynucleotide of 

claim 9 where the two sets of nucleotides are separated by nucleic acids that code for about 
1 90 amino acids (posidons). The nucleic acid polynucleotide of claim 10 where the two 
^q' 5 sets of mrcleotides are separated by the same nucleic acid sequences that separate the same 

set of special nucleotides in SEQ. ID.. NO. 1. Claims 1-11 where the fust nucleic acid of 
the first special set of amino acids, that is, the first special nucleic acid, is opcrably linked 
to any codon where the nuciic acids of that codon codes for any peptide comprising from 1 
to 10,000 amino acid (positions). The nucleic acid polynucleotide of claims 1-12 where the 
10 first special nucleic acid is opcrably linked to nucleic acid polymers that code for any 

peptide selected from the group consisting of: any any reporter proteins or proteins which 
2Q facilitate purification.Tte nucleic acid polynucleotide of claims 1-13 where the first special 

nucleic acid is opcrably linked to nucleic acid polymers that code for any peptide selected 
from the group consisting of: immunoglobin-heavy chain, maltose binding protein. 
15 glutathion S transfection. Green Fluorescent protein, and ubiquitin. Qaims 1-14 where the 

25 

last nucleic acid of the second set of special amino acids, that is, the last special nucleic 
acid, is operably linked to nucleic acid polymers that code for any peptide comprising any 
amino acids from 1 to 10,(XX) amino acids. Claims 1-15 where the last special nucleic acid 
30 is opcrably linked to any codon linked to nucleic acid polymers that code for any peptide 

20 selected from the group consisting of: any reporter proteins or proteins which facilitate 
purification. The nucleic acid polynucleotide of claims 1-16 where the first special nucleic 
acid is operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: immtmoglobin-heavy chain, maltose binding protein, glutathion S 
transfection. Green Fluorescent protein, and ubiquitin. 
25 Any isolated or purified nucleic acid polynucleotide that codes for a protease 

capable of cleaving the beta secretase cleavage site of AFP that contains two or more sets of 
special nucleic acids, where the special nucleic acids are separated by nucleic acids that 
code for about 100 to 3(X) amino acid positions, where the amino acids in those positions 
may be any amino acids, where the first set of special nucleic acids consists of the nucleic 

45 

30 acids that code for DTG, where the first nucleic acid of the first special set of nucleic acids 
is, the first special nucleic acid, and where the second set of nucleic acids code for either 
DSG or DTG. where the last nucleic acid of the second set of special nucleic acids is the 
50 l^t special nucleic acid, where the first special nucleic acid is operably linked to nucleic 
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acids thai code for any number of amino acids from zero to 8 1 amino acids and where each 
of those codons may code for any amino acid. The nucleic acid polynucleotide of claim 1 8 
, where the first special nucleic acid is operably linked to nucleic acids that code for any 
number of from 64 to 77 amino acids where each codon may code for any amino acid. The 
5 nucleic acid polynucleotide of claim 19 , where the first special nucleic acid is operably 
linked to nucleic acids that code for'71 amino acids. The nucleic acid polynucleotide of 
claim 20, where the first special nucleic acid is operably linked to 71 amino acids and 
where the first of those 71 amino acids is the amino acid T. The nucleic acid 
polynucleotide of claim 21, where the polynucleotide comprises a sequence that is at least 

10 95% identical to SEQ. ID. (Example 1 1). The nucleic acid polynucleotide of claim 22. 
where the complete polynucleotide comprises SEQ. ID. (Exanq)le 1 1). The nucleic acid 
polynucleotide of claim 18 . where the first special nucleic acid is operably linked to nucleic 
acids that code for any number of from 40 to 54 amino acids where each codon may code 
for any amino acid. The nucleic acid polynucleotide of claim 24, where the first special 

15 nucleic acid is operably linked to nucleic acids that code for 47 amino acids. The nucleic 
acid polynucleotide of claim 20, where the first special nucleic acid is operably linked to 47 
codons where the first those 47 amino acids is the amino acid E. The nucleic acid 
polynucleotide of claim 21 , where the polynucleotide comprises a sequence that is at least 
95% identical to SEQ. ID. (Example 10). The nucleic acid polynucleotide of claim 22, 

20 where the complete polynucleotide comprises SEQ. ID. (Example 10). 

Any isolated or purified nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta (p) secretase cleavage site of APP that contains two or more 
sets of special nucleic acids, where the special nucleic acids are separated by nucleic acids 
that code for about 100 to 300 amino acid positions, where the amino acids in those 

23 positions may be any amino acids, where the first set of special nucleic acids consists of the 
nucleic acids that code for the peptide DTG, where the first nucleic acid of the first special 
set of amino acids is, the first special nucleic acid, and where the second set of special 
nucleic acids code for either the peptide DSG or DTG, where the last nucleic acid of the 
second set of special nucleic acids, the last special nucleic acid, is operably linked to 

30 nucleic acids that code for any number of codons from SO to 170 codons. The nucleic acid 
polynucleotide of claim 29 where the last special nucleic acid is operably linked to nucleic 
acids comprising from 100 to 170 codons. The nucleic acid polynucleotide of claim 30 
where the last special nucleic acid is operably linked to nucleic acids comprising from 142 
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to 163 codons. The nucleic acid polynucleotide of claim 31 where the last special nucleic 
acid is operabty linked to nucleic acids comprising about 142 codons. The nucleic acid 
polynucleotide of claim 32 where the polynucleotide comprises a sequence that is at least 
95% identical to SEQ. ID. (Example 9 or 10). The nucleic acid polynucleotide of claim 
33, where the complete polynucleotide comprises SEQ. ID. (Example 9 or 10). The 
nucleic acid polynucleotide of claim-31 where the last special nucleic acid is opcrably 
linked to nucleic acids comprising about 163 codons. The nucleic acid polynucleotide of 
claim 35 where the polynucleotide comprises a sequence that is at least 95% identical to 
SEQ. ID. (Example 9 or 10). The nucleic acid polynucleotide of claim 36, where the 
complete polynucleotide comprises SEQ. ID. (Example 9 or 10). The nucleic acid 
polynucleotide of claim 3 1 where the last special nucleic acid is operably linked to nucleic 
acids comprising about 170 codons. Qaims 1-38 where the second set of special nucieid 
acids code for the peptide DSG, and optionally the first set of nucleic acid polynucleotide is 
operably linked to a peptide purification tag. Claims 1-39 where the nucleic acid 
polynucleotide is operably linked to a peptide purification tag which is six histidine. 
Oaims 1-40 where the &^t set of special nucleic acids are on one polynucleotide and the 
second set of special nucleic acids are on a second polynucleoti<k, where both first and 
second polynucleotides have at lease 50 codons. Claims MO where the first set of special 
nucleic acids are on one polynucleotide and the second set of special nucleic acids are on a 
second polynucleotide, where both first and second polynucleotides have at lease 50 codons 
where both said polynucleotides are in the same solution. A vector which contains a 
polynucleotide described in claims 1-42. A cell or cell line which contans a polynucleotide 
described in claims 1-42. 

Any isolated or purified peptide or protein comprising an amino acid polymer that is 
a protease capable of cleaving the beta (p) secretase cleavage site of APP that contains two 
or more sets of special amino acids, where the special amino acids are separated by about 
100 to 300 amino acid positions, where each amino acid position can be any amino acid^ 
where the fust set of special amino acids consists of the peptide DTG, where the first amino 
acid of the first special set of amino acids is, the first special amino acid, where the second 
set of amino acids is selected from the peptide comprising cither DSG or DTG, where the 
last amino acid of the second set of special amino acids is the last special amino acid, with 
the proviso that the proteases disclosed in SEQ ID NO. 2 and SEQ. ID NO. 6 are not 
included. The amino acid polypeptide of claim 45 where the two sets of amino acids air 
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separated by atwut 125 to 222 amino acid positions where in each position it may be any 
amino acid. The amino acid polypeptide of claim 46 where the two sets of amino acids are 
separated by about 150 to 172 amino acids. The amino acid polypeptide of claim 47 where 
the two sets t)f amino acids are separated by about 172 amino acids. The amino acid 
polypeptide of claim 48 where the protease is described in SEQ. ID. NO. 4 The amino acid 
polypeptide of claim 46 where the two sets of amino acids arc separated by about 150 to 
196 amino acids. The amino acid polypeptide of claim 50 where the two sets of amino 
acids are separated by about 196 amino acids. The amino acid polypeptide of claim 5 1 
where the two sets of amino acids arc separated by the same amino acid sequences that 
separate the same set of special amino acids in SEQ. ID. NO. 6. The amino acid 
polypeptide of claim 46 where the two sets of amino acids are separated by about 1 50 to 
190, amino acids. The amino acid polypeptide of claim 53 where the two sets of 
nucleotides arc separated by about 190 amino acids. The amino acid polypeptide of claim 
54 where the two sets of nucleotides are separated by the same amino acid sequences that 
separate the same set of special amino acids in SEQ. ID. NO. 2. Claims 45-55 where the 
first amino acid of the first special set of amino acids, that is, the first special amino acid, is 
operably linked to any peptide comprising from 1 to 10.000 amino acids. The amino acid 
polypeptide of claims 45-56 where the fust special amino acid is operably linked to any 
peptide selected from the group consisting of: any any reporter proteins or proteins which 
facilitate purification. The amino acid polypeptide of claims 45-57 where the first special 
amino acid is operably linked to any peptide selected from the group consisting of: 
iramunoglobin-heavy chain, maltose binding protein, giutathion S transfection. Green 
Fluorescent protein, and ubiquitin. Qaims 45-58. where the last amino acid of the second 
set of special amino acids, that is, the last special amino acid, is operably linked to any 
peptide comprising any amino acids from 1 to 10,000 amino acids. Claims 45-59 where the 
last special amino acid is operably linked any peptide selected from the group consisting of 
any reporter proteins or proteins which facilitate purification. The amino acid polypeptide 
of claims 45-60 where the first special amino acid is operably linked to any peptide selected 
from the group consisting of: immunoglobtn-heavy chain, maltose binding protein, 
giutathion S transfection. Green Fluorescent protein, and ubiquitin. 

Any isolated or puriHed peptide or protein comprising an amino acid polypeptide 
that codes for a protease capable of cleaving the beta secrelase cleavage site of APP that 
contains two or more sets of special amino acids, where the special amino acids are 
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separated by about 100 to 300 amino acid positions, where each amino acid in each position 
can be any amino acid, where the first set of special amino acids consists of the amino acids 
DTG, where the first amino acid of the first special set of amino acids is. the first special 
amino acid, D. and where the second set of amino acids is either DSG or DTG, where the 
last amino acid of the second set of special amino acids is the last special amino acid, G, 
where the first special amino acid is operably linked to amino acids that code for any 
number of amino acids from zero to 81 amino acid positions where in each position it may 
be any amino acid. The amino acid polypeptide of claim 62, where the fitst special amino 
^id is operably linked to a peptide from about 64 to 77 amino acids positions where each 
amino acid position may be any amino acid. The amino acid polypeptide of claim 63, 
where the first special amino acid is operably linked to a peptide of 71 amino acids. The 
amino acid polypeptide of claim 64, where the first special amino acid is operably linked to 
71 amino acids and the first of those 7 1 amino acids is the amino acid T. The amino acid 
polypeptide of claim 65, where the polypeptide comprises a sequence that is at least 95% 
identical to SEQ. ID. (Exany)le 11). The amino acid polypeptide of claim 66. where the 
complete polypeptide comprises SEQ. ID. (Example 1 1 ). The amino acid polypeptide of 
claim 62, where the first special amino acid is operably linked to any number of from 40 to 
54 amino acids (positions) where each amino acid position may be any amino acid. The 
amino acid polypeptide of claim 68, where the first special amino acid is operably linked to 
amino acids that code for a peptide of 47 amino acids. The amino acid polypeptide of claim 
69, where the first special amino acid is operably linked to a 47 amino acid peptide where 
the first those 47 amino acids is the amino acid E. The amino acid polypeptide of claim 70, 
where the polypeptide comprises a sequence that is at least 95% identical to SEQ. ID. 
(Example 10). The amino acid polypeptide where the polypeptide comprises Example 10). 

Any isolated or purified amino acid polypeptide that is a protease capable of 
cleaving the beta (p) secrctasc cleavage site of APP that contains two or more sets of 
special amino acids, where the special amino acids arc separated by about 100 to 300 amino 
acid positions, where each amino acid in each position can be any amino acid, where the 
first set of special amino acids consists of the amino acids that code for DTG, where the 
first amino acid of the first special set of amino acids is, the first special amino acid, D, and 
where the second set of amino acids are either DSG or DTG, where the last amino acid of 
the second set of special amino acids is the last special amino acid, G, which is operably 
linked to any number of amino acids fipom 50 to 170 amino acids, which may be any amino 
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acids. The amino acid polypcpiide of claim 73 where the last special amino acid is 
* operably linked to a peptide of about 100 to 170 amino acids. The amino actd polypeptide 

of claim 74 where the last special amino acid is operably linked to to a peptide of about 142 
to 163 amiBO acids. The amino acid polypeptide of claim 75 where the last special amino 
5 acid is opexably linked to to a peptide of about about 142 amino acids. The amino acid 
polypeptide of claim 76 where the polypeptide comprises a sequence that is at least 95% 
identical to SEQ, ID. (Example 9 or 10). The amino acid polypeptide of claim 75 where 
the last special amino acid is operably linked to a peptide of about 163 amino acids. The 
amino acid polypeptide of claim 79 where the polypeptide comprises a sequence that is at 
10 least 95% identical to SEQ. ID. (Example 9 or 10). The amino acid polypeptide of claim 
79, where the complete polypeptide comprises SEQ. ID. (Example 9 or 10). The amino 
2Q acid polypeptide of claim 74 where the last special amino acid is operably linked to to a 

peptide of about 170 amino acids. Claim 46-8 1 where the second set of special amino acids 
is comprised of the peptide with the amino acid sequence DSG. Claims 45-82 where the 
IS amino acid polypeptide is operably linked to a peptide purification tag. Claims 45-83 
where the amino acid polypeptide is operably linked to a peptide puriflcation tag which is 
six histidine. Claims 45-84 where the first set of special amino acids are on one 
polypeptide and the second set of special amino acids are on a second polypeptide, where 
2Q both first and second polypeptide have at lease 50 amino acids, which may be any amino 

20 acids. Claims 45-84 where the first set of special amino acids are on one polypeptide and 
the second set of special amino acids are on a second polypeptide, where both fiist and 
second polypeptides have at lease 50 amino acids where both said polypeptides are io the 
same vessel. A vector which contains a polypeptide described in claims 45-86. A cell or 
cell line which contans a polynucleotide described in claims 45-87. The process of making 
25 any of the polynucleotides, vectors, or cells of claims 1-44. The process of making any of 
the polypeptides, vectors or cells of claims 45-88. Any of the polynucleotides, 
polypeptides, vectors, cells or cell lines described in claims 1-88 made from the processes 
described in claims 89 and 90. 

Any isolated or purified peptide or protein comprising an amino acid polypeptide 

45 

30 that codes for a protease capable of cleaving the beta secretase cleavage site of APP that 
contains two or more sets of special amino acids, where the special amino acids are 
separated by about 100 to 300 amino acid positions, where each amino acid in each position 
5Q can be any amino acid, where the first set of special amino acids consists of the amino acids 

S 
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DTG, where the first amino acid of tfac first special set of amino acids is, the first special 
amino acidt, D, and where the second set of amino acids is either DSG or DTG. where the 
last amino acid of the second set of special amino acids is the last special amino acid. G. 
where the first special amino acid is operably linked to amino acids thai code for any 
nun^r of amino acids from zero to 81 amino acid positions where in each position it may 
be any amino acid. 

The amino acid polypeptide of claim 62. where the first special amino acid is 
operably linked to a peptide from about 30 to 77 amino acids positions where each amino 
acid position may be any amino acid. The amino acid polypeptide of claim 63. where the 
first special amino acid is operably linked to a peptide of 35, 47. 71. or 77 amino acids. 

The amino acid polypeptide of claim 63, where the fust special amino acid is 
operably linked to the same corresponding peptides from SEQ. ID. NO. 3 that arc 35, 47, 
71, or 77 peptides in length, beginning counting with the amino acids on the first special 
sequence, DTG, towards the N-lerminal of SEQ. ID. NO. 3. 

The amino acid polypeptide of claim 65, where the polypeptide comprises a 
sequence that is at least 93% identical to Uie same corresponding amino acids in SEQ. ID. 
NO. 4, that is. identical to that portion of the sequences in SEQ.ID. NO. 4, including all the 
sequences from both the first and or the second special nucleic acids, toward the N- 
terminal, through and including 71. 47. 35 amino acids before the first special amino acids. 
(Examples 10 and 11). 

The amino acid polypeptide of claim 65. where the complete polypeptide comprises 
the peptide of 71 amino acids, where the first of the amino acid is T and the second is Q. 
The nucleic acid polynucleotide of claim 21, where the polynucleotide comprises a 
sequence that is at least 95% identical to the same corresponding amino acids in SEQ. ID. 
NO. 3, that is. identical to the sequences in SEQ. ID. NO. 3 including the sequences from 
both die first and or the second special nucleic acids, toward the N-Tcrminal, through and 
including 71 amino acids, see Example 10, beginning from the DTG site and including the 
nucleotides from that code for 71 amino acids). 

The nucleic acid polynucleotide of claim 22, where the complete polynucleotide 
comprises identical to the same corresponding amino acids in SEQ. ID. NO. 3, that is, 
identical to the sequences in SEQ. ID. NO. 3 including the sequences from both the first 
and or the second special nucleic acids, toward the N-Tertninal, through and including 7 1 
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amino acids, see Example 10. beginning from the DTG site and including the nucleolides 
from that code for 71 amino acids). 

The nucleic acid polynucleotide of claim 18 , where the first special nucleic acid is 
operably linked to nucleic acids that code for any number of from about 30 lo 54 amino 
acids where each codon may code for any amino add. 

The nucleic acid polynucleotide of claim 20, where the first special nucleic acid is 
operably linked to 47 codons where the first those 35 or 47 amino acids is the amino acid E 
orG. 

The nucleic acid polynucleotide of claim 21, where the polynucleotide comprises a 
sequence that is at least 95% identical to the same corresponding amino acids in SEQ. ID. 
NO. 3. that is, identical to that portion of the sequences in SEQ. ID. NO. 3 including the 
sequences from both the first and or the second special nucleic acids, toward the N- 
Tcrminal, through and including 35 or 47 amino acids, see Example 1 1 for the 47 example, 
beginning from the DTG site and including the nucleotides from that code for the previous 
35 or 47 amino acids before the DTG site). The nucleic acid polynucleotide of claim 22, 
where the polynucleotide comprises identical to the same corresponding amino acids in 
SEQ. ID. NO. 3, that is, identical to the sequences in SEQ. ID. NO. 3 including the 
sequences from both the fust and or the second special nucleic acids, toward the N- 
Terminal, through and including 35 or 47 amino acids, see Exan^le 11 for the 47 example, 
beginning from the DTG site and including the nucleotides from that code for the previous 
35 or 47 amino acids before the DTG site). 

An isolated nucleic acid molecule comprising a polynucleotide, said polynucleotide 
encoding a Hu-Asp polypeptide and having a nucleotide sequence at least 95% identical to 
a sequence selected from the group consisting of: 

(a) a nucleotide sequence encoding a Hu-Asp polypeptide selected from 
the group consisting of Hu-Aspl, Hu-Asp2{a), and Hu-Asp2(b), wherein said Hu-Aspl. 
Hu-Asp2(a) and Hu-Asp2(b) polypeptides have the complete amino acid sequence of SEQ 
ID No. 2, SEQ ID No. 4, and SEQ ID No. 6. respectively; and 

(b) a nucleotide sequence complementary to the nucleotide sequence 

of (a). 

The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 
AspU and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of SEQ 
ID No. 1. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 

10 
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Asp2(a), and said polynudeoUde molecule of 1(a) comprises the nucleoiide scqueocc of 
SEQ [D No. 4. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is 
Hu-Asp2(b), and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of 
SEQ ID No. 5. Aa isolated nucleic acid molecule comprising polynucleotide which 
hybridizes under stringent conditions to a polynucleotide having the nucleotide sequence in 
(a) or (b) of claim 92. A vector coinprising the nucleic acid molecule of claim 96. The 
vector of claim 97, wherein said nucleic acid molecule is operably linked to a promoter for 
the expression of a Hu-Asp polypeptide. The vector of claim 98, wherein said Hu-Asp 
polypeptide is Hu-Asp 1 . The vector of claim 98, wherein said Hu-Asp polypeptide is Hu- 
Asp2(a). The vector of claim 98, wherein said Hu-Asp polypeptide is Hu-Asp2(b). A host 
cell comprising the vector of claim 98. A method of obtaining a Hu-Asp polypeptide 
comprising culturing the host cell of claim 102 and isolating said Hu-Asp polypeptide. An 
isolated Hu-Asp 1 polypeptide comprising an amino acid sequence at least 95% identical to 
a sequence comprising the amino acid sequence of SEQ ID No. 2. An isolated Hu-Asp2(a) 
polypeptide coinprising an amino acid sequence at least 95% identical to a sequence 
comprising the amino acid sequence of SEQ ID No. 4. An isolated Hu-Asp2(a) polypeptide 
comprising an amino acid sequence at least 95% identical to a sequence comprising the 
amino acid sequence of SEQ ID No. 8. An isolated antibody that binds specifically to the 
Hu-Asp polypeptide of any of claims 104-107. 

Here we disclose numerous methods to assay the enzyme. 

A method to identify a cell that can be used to screen for inhibitors of ^ 
secretase activity comprising: 

(a) identifying a cell that expresses a protease capable of cleaving APP at the P 
secretase site, comprising: 

i) collect the cells or the supemantent from the cells to be identified 

ii) measure the production of a crirical peptide, where the critical 
peptide is selected from the group consisting of either the APP C- 
tcrminal peptide or soluble APP, 

iii) select the cells which produce the critical peptide. 

The method of claim 108 where the cells arc collected and the critical peptide is the 
APP C-tcrminal peptide cttated as a result of the p secretase cleavage. The method of claim 
108 where the supemantent is collected and the critical peptide is soluble APP where the 
soluble APP has a C-terminal created by p secretase cleavage. The method of claim 108 

11 
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where the cells contain any of the nucleic acids or polypeptides of claims 1-86 and where 
the cells are shown to cleave the p secretase site of any peptide having the following 
peptide structure, P2, PI, PI', P2', where P2 is K or N, where PI is M orL, where PI* is 
D, where P2- is A. The method of claim 1 1 1 where P2 is K and PI is M.. The method of 
5 claim 1 12 where P2 is N and PI is L. 

Any bacterial cell comprising any nucleic acids or peptides in claims 1-86 
and 92- 107. A bacterial cell of claim 1 14 where the bacteria is £ colL Any eukaryotic cell 
comprising any nucleic acids or polypeptides in claims 1-86 and 92-107. 

Any insect cell comprising any of the nucleic acids or polypeptides in claims 

to 1-86 and 92-107. A insect cell of claim 1 17 where the Insect is sf9, or High 5. A insect 
cell of claim 100 where the insect cell is High 5. A mammalian cell comprising any of the 
nucleic acids or polypeptides in claims 1-86 and 92-107. A mammalian cell of claim 120 
where the mammalian cell is selected from the group consisting of, human, rodent, 
lagomoiph, and primate. A mammalian cell of claim 121 where the mammalian cell is 

IS selected from the group consisting of human cell. A mammalian cell of claim 122 where 
the human cell is selected from the group comprising HEK293, and IMR-32. A 
mammalian cell of claim 121 where the cell is a primate ceil. A primate cell of claim 124 
where the primate cell is a COS-7 cell. A mammalian cell of claim 121 where cell is 
selected from a rodent cells. A rodent cell of claim 126 selected from, CHO-Kl, Neuro- 

20 2A. 3T3 cells. A yeast cell of claim 1 15. An avian cell of claim 1 15. 

Any isoform of APP where the last two carboxy terminus amino acids of that 
isoform are both lysine residues. In written dcscrip. Dcfmc isoform is any APP 
polypeptide, including APP variants (including mutations), and APP fragments that exists 
in humans such as those desribed in US 5,766,846, col 7, lines 45-67, incorporated into this 

25 document by reference. The isoform of APP from claim 1 14, comprising the isoform 
known as APP695 modified so that its last two having two lysine residues as its last two 
carboxy terminus amino acids. The isoform of claim 130 comprising SEQ. ID. 16. The 
isoform variant of claim 130 comprising SEQ. ID, NO. 18. and 20. Any eukaryotic cell 
line, comprising nucleic acids or polypeptides of claim 130-132. Any cell line of claim 133 

30 that is a mammaliam cell line (HEK293, Neuro2a. best - plus others. A method for 
identifying inhibitors of an enzyme that cleaves the beta secretase cleavabe site of APP 
comprising: 
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8} cultuhng cells in a culture medium under conditions to which the enzyme 
causes processing of APP and release of amyloid beta-peptidc into the medium and causes 
the accumulation of CTF99 fragments of APP in cell lysates. 

b) • - exposing the cultured cells to a test compound; and specifically determining 
whether the test compound inhibits the function of the enzyme by measuring the amount of 
amyloid bcia-pcptide released into the medium and or the amount of CTF99 fragments of 
APP in cell lysates; 

c) identifying test compounds diminishing the amount of soluble amyloid beta 
peptide present in the culture medium and diminution of CTP99 fragments of APP in cell 
lysates as Asp2 inhibitors. 

The method of claim 135 wherein the cultured cells are a human, rodent or insect 
cell line. The method of claim 136 wherein the human or rodent cell line exhibits P 
secretase activity in which processing of APP occurs with release of amyloid beta-pcptide 
into the culture medium and accumulation of CTF99 in cell lysates. A method as in claim 
137 wherein the human or rodent cell line treated with the antisensc oligomers directed 
against the enzyme that exhibits ^ secretase activity, reduces release of soluble amyloid 
beta-peptide into the culture medium and accumulation of CTF99 in celt lysates. A method 
for the identification of an agent that decreases the activity of a Hu-Asp polypeptide 
selected from the group consisting of Hu-Asp 1. Hu-Asp2(a). and Hu-Asp2(b). the method 
comprising: 

a) determining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 

b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide 
determined in the absence of said test agent; 

whereby a lower level of activity in the presence of said test agent than in the absence of said 
test agent indicates that said test agent has decreased the activity of said Hu-Asp 
polypeptide..The nucleic acids, peptides, proteins, vectors, cells and cell lines, and assays 
described herein. 

The present invention provides isolated nucleic acid molecules comprising a 
polynucleotide that codes for a polypeptide selected from the group consisting of human 
aspartyl proteases. In particular, human aspaityl protease 1 (Hu-Asp 1) and two alternative 
si^ce variants of human aspartyl protease 2 (Hu-A5p2). designated herein as Hu-A5p2(a) and 
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Hu-Asp2(b). As used herein, alt references to "Hu-Asp" should be understood to refer to all of 
^ Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b). In addition, as used herein, all references to "Hu- 

Asp2" should be understood to refer to both Hu-Asp2(a) and Hu-Asp2(b). Hu-Asp 1 is 

expressed most ^undantly ia pancreas and prostate tissues, while Hu-Asp2(a) and Hu- 
jq' s Asp2(b) BR expressed most abundantly in pancreas and brain tissues. Tbe invention also 

provides isolated Hu-Asp I, Hu-Asp2Ca), and Hu-Asp2(b) polypeptides, as well as fragments 

thereof which exhibit aspartyl protease activity. 

In a preferred embodiment, the nucleic acid molecules comprise a polynucleotide 

having a nucleotide sequence selected from tbe group consisting of residues I -1554 of SEQ 
10 ID NO:U encoding Hu-Aspl. residues 1-1503 of SEQ K> NO:3, encoding Hu-Asp2(a), and 

residues 1-1428 of SEQ ID NOtS, encoding Hu-Asp2(b). In another aspect, the invention 
20 provides an isolated nucleic acid molecule comprising a polynucleotick which hybridizes 

under stringent conditions to a polynucleotide encoding Hu-Aspl, Hu-Asp2(a). Hu-Asp-2(b). 

or fragments thereof. European patent application EP 0 848 062 discloses a polypeptide 
15 referred to as "Asp I," that bears substantia] homology to Hu-Aspl, while international 

25 

application WO 98/22597 discloses a polypeptide rcfisrrcd to as "Asp 2," that bears substantial 
homology to Hu-Asp2(a). 

The present invention also provides vectors comprising tbe isolated nucleic acid 
30 molecules of the invention, host cells into which such vectors have been introduced, and 

20 recombinant methods of obtaining a Hu-Aspl. Hu-A5p2(a}. or Hu-Asp2(b) polypeptide 
comprising cultunng the above-described host cell and isolating the relevant polypeptide. 

In another aspect, the invention provides isolated Hu-Aspl, Hu-Asp2(a), and Hu- 
Asp2(b) polypeptides, as well as fragments thereof. In a preferred embodiment, the Hu-Asp I, 
Hu-Asp2(a), and Hu-Asp2(b) polypeptides have the amino acid sequence given in SEQ ID 
25 N0:2, SEQ ID NO:4, or SEQ ID N0:6, respectively. The present invention also desciibes 
4Q active forms of Hu-Asp2, methods for preparing such active forms, methods for preparing 

soluble forms, methods for measuring Hu-Asp2 activity, and substrates for Hu- Asp2 cleavage. 
■ The invention also describes antisense oligomers targeting the Hu-Aspl , Hu-Asp2(a) and Hu- 
Asp2(b) mRNA transcripts and the use of such antisense reagents to decrease such mRNA 

45 

30 and consequently the production of the corresponding polypeptide. Isolated antibodies, both 
polyclonal and monoclonal, that binds specifically to any of the Hu-Aspl, Hu-A5p2(a), and 
Hu-Asp2Cb) polypeptides of the invention arc also provided. 

50 
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The invention also provides a method for the identification of an agent that modulates 
the activity of any of Hu-Asp-l. Hu-Asp2(a). and Hu-Asp2Cb). The inventions describes 
methods to test such agents in cell-free assays to which Hu-Asp2 polypeptide is added, as well 
as methods to test such agents in human or other mammalian cells in which Hu-Asp2 is 
5 present. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTINGS 

Sequence ID No. 1 — ^Human Asp-1, nucleotide sequence 
Sequence ID No. 2 — ^Human Asp- 1, predicted amino acid sequence 
Sequence ID No. 3 — ^Human Asp-2(a). nucleotide sequence 

10 Sequence ID No. A — ^Human Asp-2(a), predicted amino acid sequence 
Sequence ID No. 5 — ^Human Asp-2<b), nucleotide sequence 
Sequence ID No. 6 — Human Asp-2(b), predicted amino acid sequence • 
Sequence ID No. 7 — Murine Asp-2(a), nucleotide sequence 
Sequence ID No. 8 — Murine Asp- 2(a), predicted amino acid sequence 

15 Sequence ID No. 9 — Human APP695, nucleotide sequence 

Sequence ID No. 10 — ^Human APP695, predicted amino acid sequence 
Sequence ID No. II — Human APP695-Sw, nucleotide sequence 
Sequence ID No. 12 — Human APP695-Sw. predicted amino acid sequence 
Sequence ID No. 13 — Human APP695-VF, nucleotide sequence 

20 Sequence ID No. 14 — Human APP695-VF. predicted amino acid sequence 
Sequence ID No. 15 — Human APP695-KK, nucleotide sequence 
Sequence ID No. 16 — Human APP695-KK, predicted amino acid sequence 
Sequence ID No. 17 — Human APP695-Sw-K:IC nucleotide sequence 
Sequence ID No. 1 8 — Human APP695-S w-KK, predicted amino acid sequence 

25 Sequence ID No. 1 9 — Human APP695-VF-KK, nucleotide sequence 

Sequence ID No.20 — Human APP695-VF-KK, predicted amino acid sequence 
Sequence ID No,21 — T7-Hunian-pro-Asp-2(a)ATM. nucleotide sequence 
Sequence ID No.22 — T7-Human-pro-Asp-2(a)ATM, amino acid sequence 
Sequence ID No.23 — T7-Caspase-Human-pro-Asp-2(a)ATM, nucleotide sequence 

30 Sequence ID No.24~ T7-Caspase-Human-pro-Asp-2(a)ATM, amino acid sequence 
Sequence ID No.25 — Human-pro-Asp-2(a)ATM (low GQ, nucleotide sequence 
Sequence ID No.26 — Human-pro-Asp-2(a)ATM, (low GC), amino acid sequence 
Sequence ID No-27 — T7-Caspase-Caspase 8 cleavage-Human-pro-Asp-2(a)ATM, 
nucleotide sequence 

35 Sequence ED NoJ28 — T7-Caspase-Caspase 8 clcavage-Human-pro-Asp-2(a)ATM, amino 
acid sequence 

Sequence ID No.29 — Human Asp-2(a)ATM, nucleotide sequence 
Sequence ID No.30 — Human Asp-2(a)ATM, amino acid sequence 
Sequence ID No.3l — Human Asp-2(a)ATM(His)6 . nucleotide sequence 
40 Sequence ID No.32 — Human Asp-2(a)ATM(His)6, amino acid sequence 

Sequence ID No.5 33-46 are described below in the Detailed Description of the Invention. 



BRIEF DESCRIPTION OF THE FIGURES 
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Rgure 1: Figure 1 shows the nucleotide (SEQ ID NO: I) and predicted amino 
acid sequence (SEQ ID N0:2) of human Asp 1 . 

Figure 2: Figure 2 shows the nucleotide (SEQ ID N0:3) and predicted amino 
add sequoice (SEQ ID N0:4) of human Asp2(a). 

Figure 3: Figure 3 shows the nucleotide (SEQ ID NO:5) and predicted amino 
add sequence (SEQ ID NO:6) of human Asp2(b). The predicted transmembrane domain of 
Hu-Asp2(b) is cndosed in brackets. 

Figure 4: Figure 4 shows the nucleotide (SEQ ID No. 7) and predicted amino 
acid sequence (SEQ ID No. 8) of murine Assp2(a) 

Figure 5: Figure 5 shows the BestFit alignment of the predicted amino acid 
sequences of Hu-Asp2(a) and murine Asp2(a) 

Figure 6: Figure 6 shows the nucleotide (SEQ ID No. 21) and predicted amino 
acid sequence (SEQ ID No. 22) of T7-Human-pro-Asp-2(a)ATM 

Figure?: Figure 7 shows the nucleotide (SEQ ID No. 23) and predicted amino 
add sequence (SEQ ID No. 24) of T7-caspase-Human-pro-Asp-2(a)ATM 

Figure 8: Figure 8 shows the nucleotide (SEQ ID No. 25) and predicted amino 
acid sequence (SEQ ID No. 26) of Human-pro- Asp-2(a)ATM (low GC) 

Figure 9: Western blot showing reduction of CTF99 production by HEK 125.3 
cells transfected with antisense oligomers targeting the Hu-Asp2 Mma 

Figure 10: Western blot showing increase in CrF99 production in mouse 
Neuro-2a cells cotransfected with APP-KK with and without Hu-Asp2 only in those cells 
cotransfected with Hu-Asp2. A further increase in CTF99 production is seen in cells 
cotransfected with APP-Sw-KK with and without Hu-Asp2 only in those cells cotransfected 
with Hu-Asp2 

Figure 1 1: Figure 1 1 shows the predicted amino acid sequence (SEQ ID No. 30) 
of Human- Asp2(a)ATM 

Figure 12: Figure 1 1 shows the predicted amino acid sequence (SEQ ID No. 30) 
of Human- Asp2(a)ATM(His)6 
DETAILED DESCRIPTION OF THE INVENTION 

A few definitions used in this invention follow, most definitions to be used are those 
that would be used by one ordinarily skilled in the art 

When the p amyloid peptide any peptide resulting from beta secretase cleavage of 
APP. This includes, peptides of 39. 40. 41, 42 and 43 amino acids, extending from the p- 
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secietase cleavage site to 39. 40. 41, 42 and 43 amuio acids, p amyloid pqnidc also means 
sequences 1-6. SEQ. ID. NO. 1-6 of US 5,750349, issued 12 May 1998 (incorporated into 
this document by reference). A p-sccretasc cleavage fragment disclosed here is called CTF- 
99, which extends from p-sccrctasc cleavage site to the carboxy terminus of APP. 

When an isofonn of APP is discussed then what is meant is any APP polypeptide, 
including APP variants (including mutations), and APP fragments that exists in humans 
such as those desiibed in US 5,766,846, col 7, lines 45-67, incorporated into this document 
by reference and see below. 

The term "^amyloid precursor protein" (APP) as used herein is defined as a 
polypeptide that isxncoded by a gene of the same name localized in humans on the long 
arm of chromosome 21 and that includes "PAP - here "p-amyloid protein" see above, 
within its carboxyl third. APP is a glycosylated, single-membrane spanning protein 
expressed in a wide variety of cells in many mammaliam tissues. Examples of specific 
isotypes of APP which are currently known to exist in himians are the 695-amino acid 
polypeptide described by Kang ct. al. (1987) Nature 325:733-736 which is designated as the 
"normal** APP; the 751-amino acid polypeptide described by Ponte et al. (1988) Nature 
331:525-527 (1988) and Tanzi et al. (1988) Nature 33 1:528-530; and the 770-araino acid 
polypeptide described by Kitaguchi eL al. (1988) Nature 331:530-532. Examples of specific 
variants of APP include point mutation which can differ in both position and phcnotypc (for 
review of known variant mutation see Hardy (1992) Nature Genet. 1:233-234). All 
references cited here incorporated by reference. The term "APP fragments" as used herem 
refers to fragments of APP other than those which consist solely of pAP or pAP fragments. 
That is, APP fragments will include amino acid sequences of APP in addition to those 
which form intact 3 AP or a fragment of pAP. 

When the term "any amino acid" is used, the amino acids referred to are to be selected 
from the following, three letter and single letter ^breviations - which may also be used, are 
provided as follows: 

Alanine, Ala, A; Arginine, Arg. R; Asparagine, Asn, N; Aspartic acid. Asp, D; 
Cystein. Cys. C; Glutamine, Gin, Q; lu;E-Giutamic Acid, Glu, E; Glycine, Gly, G; 
Histidine. His, H; Isolcucine. lie. I; Leucine, Leu, L; Lysine, Lys, K; Methionine, Met, 
M; Phenylalanine, Phe, F; Proline, Pro, P; Serine, Scr.S; Threonine, Thr, T; Tryptophan. 
Trp, W; Tyrosine, Tyr, Y; Valine, Val, V; Aspartic acid or Asparagine, Asx, B; Glutamic 
acid or Glutamine. Glx. Z; Any amino acid, Xaa, X.. 
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The present invention describes a method to scan gene databases for the simple 
^ active site motif characteristic of aspartyl proteases. Eukaiyotic aspartyl proteai^es such as 

pepsin and renin possess a two-domain structure which folds to bring two aspartyl residues 
into proximity within the active site. These are embedded in the short iripcptide motif 
5 DTG. or more rarely, DSG. Most aspartyl proteases occur as proenzyme whose N-terminus 
must be cleaved for activation. The DTG or DSG active site motif appears at about residue 
65-70 in the proenzyme (prorenin, pepsinogen), but at about residue 25-30 in the active 
enzyme alter cleavage of the N-terminal prodomain. The limited length of the active site 
motif makes it difficult to search collections of short, expressed sequence tags (EST) for 
10 novel aspartyl proteases. EST sequences typically average 250 nucleotides or less, and so 
would encode 80-90 amino acid residues or less. That would be too short a sequence to 
20 span the two active site motifs. The preferred method is to scan databases of hypothetical 

or assembled protein coding sequences. The present invention describes a computer 
method to identify candidate aspartyl proteases in protein sequence databases. The method 
15 was used to identify seven candidate aspartyl protease sequences in the Caenorhabditis 

25 

eiegans genome. These sequences were then used to identify by homology search Hu-Aspl 
and two alternative splice variants of Hu-Asp2. designated herein as Hu-Asp2(a) and Hu- 
Asp2{b). 

In a major aspect of the invention disclosed here we provide new information about 
20 APP processing. Pathogeneic processing of the amyloid precursor protein (APP) via the 

pathway requires the sequential action of two proteases referred to as ^-secretase and y- 
secretasc. Cleavage of APP by the p-secrctasc and -y-secretase generates the N-cenninus 

35 

and C-terminus of the Ap peptide, respectively. Because over production of the Ap 
peptide, particularly the APmi, has been implicated in the initiation of Alzheimer's disease, 
25 inhibitors of either the ^secrctase and/or the 7-secrctase have potential in the treatment of 
^0 Alzheimer's disease. Despite the importance of the P-sccrctase and T^secrctasc in the 

pathogenic processing of APP, molecular definition of these enzymes has not been 
accomplished to date. That is. it was not kiKswn what enzymes were required for cleavage 
at either the ^sccrctasc or the y-secrctase cleavage site. The sites themselves were 
30 icnown because APP was known and the ApM2. peptide was known, see US 5.766.846 and 
US 5,837,672, (incorporated by reference, with the exception to reference to "soluble" 
peptides). But what enzyme was involved in producing the A^j^:* peptide was unlutown. 

50 
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The present invention involves the molecular definition of several novel human 
fispartyl proteases and one of these, referred to as Hu-Asp-2(a) and Hu-Asp2(b), has been 
characterized in detail. Previous forms of asp I and asp 2 have been disclosed, see EP 
0848062 A2and EP 0855444A2, inventors David Powe! et. aJ., assigned to Smith Kline 
Beecham Corp. (incorporated by reference). Herein are disclosed old and new forms of 
Hu-Asp 2. For the first time they art expressed in active form, their substrates arc 
disclosed, and their specificity is disclosed. Prior to this disclosure cell or cell extracts were 
required to cleave the P-secretase site, now purified protein can be used in assays, also 
described here. Based on the results of (1) aniisensc knock out experiments. (2) transient 
iransfection knock in experiments, and (3) biochemical experiments using purified 
recombinant Hu-Asp-2, we demonstrate that Hu-Asp-2 is the P-secretase involved in the 
processing of APP. Although the nucleotide and predicted amino acid sequence of Hu- 
Asp-2(a) has been reported, see above, see EP 0848062 A2 and EP 0855444A2, no 
functional characterization of the enzyme was disclosed. Here the authors characterize the 
Hu-Asp-2 enzyme and are able to explain why it is a critical and essential enzyme required 
in the formation of Api^s, peptide and possible a critical step in the development of AD. 

In another embodiment the present invention also describes a novel splice variant of 
Hu-Asp2, referred to as Hu-Asp-2(b), that has never before been disclosed. 

In another embodiment, the invention provides isolated nucleic acid molecules 
comprising a polynucleotide encoding a polyp^tide selected from the group consisting of 
human aspaityl protease 1 (Hu-Asp 1) and two alternative splice variants of human aspartyl 
protease 2 (Hu-Asp2), designated herein as Hu-Asp2(a) and Hu-Asp2(b). As used herein, aU 
references to **Hu-Asp2" should be understood to refer to both Hu-Asp2(a) and Hu-Asp2(b). 
Hu-Asp 1 is expressed most abundantly in pancreas and prostate tissues, while Hu-Asp2(a) 
and Hu-Asp2(b) are expressed most abundantly in pancreas and brain tissues. The invention 
also provides isolated Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b) polypeptides, as wcU as 
firagments thereof which exhibit aspartyl protease activity. 

The predicted amino acid sequences of Hu-Asp 1, Hu-Asp2(a) and Hu-Asp2{b) share 
significant homology with f»:eviously identified mammalian aspairyl proteases such as 
pepsinogen A. pepsinogen B. cathcpsin D, caihepsin E, and renin. P3.Szecs, Scand. J. Clin. 
Lab. Invest. 52:(SuppI. 210 5-22 (1992)). These enzymes are characterized by the presence of 
a duplicated DTG/DSG sequence motif. The Hu-Asp 1 and HuAsp2 polypeptides disclosed 
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herein also exhibit extremely high homology with the ProSite consensus motif for asparty] 
proteases extracted from the SwissProt database. 

The nucleotide sequence given as residues 1-1554 of SEQ ID N0;1 corresponds to 
the nucleotide sequence encoding Hu-Aspl. the nucleotide sequence given as residues 1-1503 
of SEQ ID NO; 3 corresponds to the nucleotide sequence encoding Hu-Asp2(a), and the 
nucleotide sequence given as residues 1-1428 of SEQ ID N0:5 corresponds to the nucleotide 
sequence encoding Hu-Asp2(b). The isolation and sequencing of DNA encoding Hu-Aspl, 
Hu-Asp2(a), and Hu-Asp2Cb) is described below in Examples 1 and 2. 

As is described in Examples 1 and 2. automated sequencing methods were used to 
obtain the nucleotide sequence of Hu-Aspl, Hu-Asp2(a), and Hu-Asp-2{b). The Hu-Asp 
nucleotide sequences of the present invention were obtained for both DNA strands, and are 
believed to be 100% accurate. However, as is known in the an, nucleotide sequence obtained 
by such automated methods may contain some errors. Nucleotide sequences determined by 
automation are typically at least about 90%, more typically at least about 95% to at least about 
99.9% idendcal to the actual nucleotide sequence of a given nucleic acid molecule. The 
actual sequence may be more precisely determined using manual sequencing methods, which 
arc well known in the art. An error in sequence which results in an insertion or deletion of 
one or more nucleotides may result in a frame shift in translation such that the predicted 
amino acid sequence will differ from that which would be predicted from the actual 
nucleotide sequence of the nucleic acid molecule, starting at the point of the mutation. The 
Hu-Asp DNA of the present invention includes cDNA, chemically synthesized DNA, DNA 
isolated by PGR, genomic DNA, and combinations thereof. Genomic Hu-Asp DNA may be 
obtained by screening a genomic hbraiy with the Hu-Asp2 cDNA described herein, using 
methods that are well known in the art, or with oligonucleotides chosen fix)m the Hu-Asp2 
sequence that will prime the polymerase chain reaction (PGR). RNA transcribed from Hu- 
Asp DNA is also encompassed by the present invention. 

Due to the degeneracy of the genetic code, two DNA sequences may differ and yet 
encode identical amino acid sequences. The present invention thus provides isolated nucleic 
acid molecules having a polynucleotide sequence encoding any of the Hu-Asp polypeptides of 
the invention, wherein said polynucleoudc sequence encodes a Hu-Asp polypeptide having 
the complete amino acid sequence of SEQ ID N0:2. SEQ ID NO:4, SEQ ID N0:6, or 
fragments thereof. 
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Abo provided herein are purified Hu-Asp polypeptides, both recombinant and non- 
rccombinini. Most importantly, methods to produce Hu-Asp2 polypeptides in active form arc 
provided. These include production of Hu-Asp2 polypeptides and variants thereof in bacterial 
cells, insect cells, and mammalian cells, also in forms that allow secretion of the HiJ-Asp2 
5 polypeptkfc from bacterial, insect or mammalian cells into the culture medium, also methods 
to produce variants of Hu-Asp2 polypeptide incorporating amino acid tags that facilitate 
subsequent purification. In a preferred embodiment of the invention the Hu-Asp2 polypeptide 
is converted to a proteolyiically active form either in transfonncd cells or after purification 
and cleavage by a second protease in a ceU-ftee system, such active forms of the Hu-Asp2 

10 polypeptide beginning with the N-terminal sequence TQHGIR or ETDEEP. Variants and 
derivatives, including fragments, of Hu-Asp proteins having the native amino acid sequences 
given in SEQ ID Nos: 2, 4, and 6 that retain any of the biological activities of Hu-Asp arc also 
within the scope of the present invention. Of course, one of ordinaiy skill in the art will 
readily be able to determine whether a variant, derivative, or fragment of a Hu-Asp protein 

15 displays Hu-Asp activity by subjecdng the variant, derivative, or fragment to a standard 
aspartyl protease assay. Fragments of Hu-Asp within the scope of this invention include those 
that contain the active site domain containing the amino acid sequence DTC. fragments that 
contain the active site domain anuno acid sequence DSG, fragments containing both the DTG 
and DSG active site sequences, fragments in which the spacing of the DTG and DSG active 

20 site sequences has been lengthened* fragments in which the spacing has been shortened. Also 
within the scope of the invention arc fragments of Hu-Asp in which the transmembrane 
domain has been removed to allow production of Hu-Asp2 in a soluble form. In another 
embodinwnt of the invention, the two halves of Hu-Asp2, each containing a single active site 
DTG or VSC sequence can be produced independently as recombinant polypeptides, then 

25 combined in solution where they reconstitute an active protease. 

Hu-Asp variants may be obtained by mutation of native Hu-Asp-cncoding nucleotide 
sequences, for example. A Hu-Asp variant, as referred to herein, is a polypeptide 
substantially homologous to a native Hu-Asp polypeptide but which has an amino acid 
sequence different from that of native Hu-Asp because of one or more deletions, insertions, or 

30 substitutions in the amino acid sequence. The variant amino acid or nucleotide sequence is 
preferably at least about 80% identical, more preferably at least about 90% identical, and most 
preferably at least about 95% identical, to a native Hu-Asp sequence. Thus, a variant 
nucleotide sequence which contains, for example, S point mutations for every one hundred 
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nucleotides, as compared to a native Hu-Asp gene, will be 95% identical to the native protein. 
The percentage of sequence identity, also termed homology, between a native and a variant 
Hu-Asp sequence may also be determined, for example, by comparing the two sequences 
using any of- the computer programs commonly employed for this purpose, such as the Gap 
piogiam (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer 
Group, University Research Park, Madison Wisconsin), which uses the algorithm of Smith 
and Waterman (Adv. Appl Math. 2: 482-489 (1981)). 

Alterations of the native amino acid sequence may be accomplished by any of a 
number of known techniques. For example, mutations may be introduced at particular 
locations by procedures well known to the skilled artisaii, such as oligonucieotidc-directed 
mutagenesis, which is described by Walder et ai {Gene 42:13^ (1986)); Bauer et aL {Gene 
37:13 (1985)); Oaik (BioTechniques, January 1985. pp. 12-19); Smith e: ai (Genetic 
Engineering: Principles and Methods, Plenum Press (1981)); and U.S. Patent Nos. 4,518,584 
and 4.737.462. 

Hu-Asp variants within the scope of the invention may comprise conservatively 
substituted sequences, meaning that one or more amino acid residues of a Hu-Asp p<rfypeptide 
are replaced by different residues that do not alter the secondary and/or tertiary structure of the 
Hu-Asp polypeptide. Such substimtions may include the replacement of an amino acid by a 
residue having similar physicochemical properties, such as substituting one aliphatic residue 
(He, Val, Leu or Ala) for another, or substimtion between basic residues Lys and Arg, acidic 
residues Glu and Asp, amide residues Gin and Asn, hydroxyl residues Ser and Tyr, or 
aromatic residues Phe and Tyr. Further information regarding making phenotypically silent 
amino acid exchanges may be found in Bowie et al.. Science 247:1306-1310 (1990). Other 
Hu-Asp variants which might retain substantially the biological activities of Hu-Asp are those 
where amino acid subsiiiutioos have been made in areas outside functional regions of the 
protein. 

In another aspect, the invention provides an isolated nucleic acid molecule comprising 
a polynucleotide which hybridizes under stringent conditions to a porrion of the nucleic acid 
molecules described above, e.g.. to at least about 15 nucleotides, preferably to at least about 
20 nucleotides, more preferably to at least about 30 nucleotides, and still more preferably to at 
least about from 30 to at least about 100 nucleotides, of one of the previously described 
nucleic acid molecules. Such portions of nucleic acid molecules having the described lengths 
refer to. e.g,, at least about 15 contiguous nucleotides of the reference nucleic acid molecule. 
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By stringent hybridization conditions is intended overnight incubation at about Al^C for about 
2.5 bouis in 6 X SSC70.1% SDS, followed by washing of the filters in LO X SSC at 65*C 
0.1% SDS. 

Fragments of the Hu -Asp-encoding nucleic acid molecules described herein, as well as 
5 polynucleotides capable of hybridizing to such nucleic acid molecules may be used as a probe 
or as primers in a polymerase chain reaction (PGR). Such probes may be used, e.g., to detect 
the presence of Hu-Asp niKleic acids in in vitro assays, as well as in Souttttm and northern 
blots. Cell types expressing Hu-Asp may also be identified by the use of such probes. Such 
procedures are well known, and the skilled artisan will be able to choose a probe of a length 

10 suitable to the particular application. For PGR, 5' and 3' primers corresponding to the termini 
of a desired Hu-Asp nucleic acid molecule are en^loyed to isolate and amplify that sequence 
using conventional techniques. 

Other useful fragments of the Hu-Asp nucleic acid molecules are antisense or sense 
oligonucleotides comprising a single-stranded nucleic acid sequence capable of binding to a 

15 target Hu-Asp mRNA (using a sense strand), or Hu-Asp DNA (using an antisense strand) 
sequence. In a prefeired embodimeni of the invention these Hu-Asp antisense 
oligonucleotides reduce Hu-Asp mRNA and consequent production of Hu-Asp polypeptides. 

In another aspect, the invention includes Hu-Asp polypeptides with or without 
associated native pattern glycosylaiioa Both Hu-Aspl and Hu-Asp2 have canonical acceptor 

20 sites for Asn- linked sugars, with Hu-Aspl having two of such sites, and Hu-A$p2 having four. 
Hu-Asp expressed in yeast or mammalian expression systems (discussed below) may be 
similar to or significantly different from a native Hu-Asp polypeptide in molecular weight 
and glycosylation pattern. Expression of Hu-Asp in bacterial expression systems will provide 
non-glycosylaied Hu-Asp. 

25 The polypeptides of the present invention are prefei^iy provided in an isolated form, 

and preferably arc substanlially purified. Hu-Asp polypeptides may be recovered and purified 
from tissues, cultured cells, or recombinant cell cultures by well-known methods, including 
aiiunonium sulfate or cthanol precipitation, anion or cation exchange chromatography, 
phosphocellulosc chromatography, hydrophobic interaction chromatography, affinity 

30 chromatography, hydroxylapattte chromatography, lectin chromatography, and high 
performance liquid chromatography (HPLC). In a preferred embodiment, an amino acid tag is 
added to the Hu-Asp polypeptide using genetic engineering techniques that arc well known to 
practioners of the art which include addition of six histidine amino acid residues to allow 
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purification by binding lo nickel immobilized on a suitable support epitqws for polyclonal or 
monoclonal antibodies including but not limited to the T7 epitope, the myc epitope, and the 
V5a epitope, and fusion of Hu-Asp2 lo suitable protein partners including but not limited to 
glutathionc-S-transferase or maltose binding protien. In a prefcired embodiment these 
additional amino acid sequences are added to the C-tcrminus of Hu-Asp but may be added to 
the N-tenninus or at intervening positions within the Hu-Asp2 polypeptide. 

The present invention also relates to vectors comprising the polynucleotide molecules 
of the invention, as well as host cell transformed with such vectors. Any of the polynuclecKide 
molecules of the invention may be joined to a vector, which generally includes a selectable 
marker and an origin of replication, for propagation in a host. Because the invention also 
provides Hu-Asp polypeptides expressed from die polynucleotide molecules described above, 
vectors for the expression of Hu-Asp ore preferred. The vectors include DNA encoding any of 
the Hu-Asp polypeptides described above or below, operably linked to suitable transcriptional 
or iranslational regutaiOTy sequences, such as those derived from a mammalian, microbial, 
viral, or insect gene. Examples of regulatory sequences include transcriptional promoters, 
operators, or enhancers, mRNA ribosomal binding sites, and appropriate sequences which 
control transcription and translation. Nucleotide sequences are operably linked when the 
regulatory sequence functionally relates lo the DNA encoding Hu-Asp. Thus, a promoter 
nucleotide sequence is operably linked to a Hu-Asp DNA sequence if the promoter nucleotide 
sequence directs tiie transcription of the Hu-Asp sequence. 

Selection of suitable vectors lo be used for die cloning of polynucleotide molecules 
encoding Hu-Asp, or for the expression of Hu-Asp polypeptides, will of course depend upon 
the host cell in which the vector will be transformed, and. where applicable, the host cell from 
which the Hu-Asp polypeptide is to be expressed. Suitable host cells for expression of Hu- 
Asp polypeptides include prokaiyotes, yeast, and higher cukaryotic cells, each of which is 
discussed below. 

The Hu-Asp polypeptides to be expressed in such host cells may also be fusion 
proteins which include regions from heterologous proteins. Such regions may be included to 
allow, e,g,, secretion, improved stability, or facilitated purification of the polypeptide. For 
example, a sequence encoding an appropriate signal peptide can be incorporated into 
expression vectors. A DNA sequence for a signal peptide (secretory leader) may be fused 
in-frame to the Hu-A^ sequence so tiiat Hu-Asp is translated as a fusion protein comprising 
the signal peptide. A signal peptide that is functional in the intended host cell promotes 
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extracellular secretion of the Hu-Asp polypeptide. Preferably, tbe signal sequence will be 
cleaved from the Hu-Asp polypeptide upon secretion of Hu-Asp from the cell. Non-limiting 
examples of signal sequences that can be used in practicing the invention include the yeast 
I-factor and the honeybee melatin leader in sf9 insect cells. 

In a preferred embodiment, the Hu-Asp polypeptide will be a fusion protein which 
includes a heterologous region used to facilitate purification of the polypeptide. Many of the 
available peptides used for such a function allow selective binding of the fusion protein to a 
binding paitaer. For example, the Hu-Asp polypeptide may be modified to comprise a 
peptide to form a fusion protein which specifically binds to a binding partner, or peptide tag. 
Non-iimiting examples of such peptide tags include the 6-His tag, thioredoxin tag, 
hemaglutinin tag, GST tag. and OmpA signal sequence tag. As will be understood by one of 
skill in the art, the binding partner which recognizes and binds to the peptide may be any 
molecule or compound including metal ions (e.g.. metal affmity columns), antibodies, or 
fragments thereof, and any protein or peptide which binds the peptide, such as the FLAG tag. 

Suitable host cells for expression of Hu-Asp polypeptides includes prokaryotes, yeast, 
and higher eulcaiyotic cells. Suitable prokaryotic hosts to be used for the expression of Hu- 
Asp include bacteria of the genera Escherichia^ Bacillus, and Saimonella, as well as members 
of the geneia Pseudomonas. Streptomyces, and Staphylococcus. For expression in, e.g., E. 
colt, a Hu-Asp polypeptide may include an N-tcrminal methionine residue to facilitate 
expression of the recon:U:>iiiant polypeptide in a prokaryotic host, "nie N-tenninal Met may 
optionally then be cleaved from the expressed Hu-Asp polypeptide. Other N-terminal amino 
acid residues can be added to the Hu-Asp polyi»ptide to facilitate expression in Escherichia 
colt including but not limited to the T7 leader sequence, the T7-caspase 8 leader sequence, as 
well as others leaders including tags for purification such as the 6-His tag (Example 9). Hu- 
Asp polypeptides expressed in E. coli may be shortened by removal of the cytoplasmic tail, 
tbe transmembrane domain, or the membrane proximal region. Hu-Asp polypeptides 
expressed in E, coli may be obtained in either a soluble forai or as an insoluble form which 
may or may not be present as an inclusion body. Tbe insoluble polypeptide may be rendered 
soluble by guanidine HCl, uiea or other protein deoaturants, then refolded into a soluble form 
before or after purification by dilution or dialysis into a suitable aqueous buffer. If the 
inactive proform of the Hu-Asp was produced using recombinant methods, it nuy be rendered 
active by cleaving off the prosegment with a second suitable protease such as human 
immunodeficiency virus protease. 

25 



wo 00/17369 PCTAJS99/20881 

Expression vectors for use in prokaryotic hosts generally comprises one or more 
phenotypic selectabie marker genes. Such genes generally encode, e.g., a protein that confers 
antibiotic resistance or thai supplies an auxotrophic requirement. A wide variety of such 
vectors arc-readily available from commercial sources. Examples include pSPORT vectors, 
pGEM vectors (Promega), pPROEX vectors (LTI, Betbesda, MD). Biuescripi vectors 
(Straiagene), pET vectors (Novagen) and pQE vectors (Qiagen). 

Hu-Asp may also be expressed in yeast host cells from genera including 
Saccharomyces, Pichia, and Kiuveromyces. Preferred yeast hosts arc 5. cerevisiae and P. 
pastoris. Yeast vectors will often contain an origin of replication sequence from a XT yeast 
piasmid, an autonomously replicating sequence (ARS), a promoter region, sequences for 
polyadcnylation, sequences for transcription termination, and a selectable marker gene. 
Vectors replicable in both yeast and £ coli (termed shuttle vectors) may also be used, in 
addition to the above-mentioned features of yeast vectors, a shuttle vector will also include 
sequences for replication and selection in £ coli. Direct secretion of Hu-Asp polypeptides 
expressed in yeast hosts may be accomplished by the inclusion of nucleotide sequence 
encoding the yeast T-facior leader sequence at the 5* end of the Hu- Asp-encoding nucleotide 
sequence. 

Insect host cell culture systems may also be used for the expression of Hu-Asp 
polypeptides. In a preferred cmbodimcot, the Hu-Asp polypeptides of the invention arc 
expressed using an insect cell expression system (see Example 10). Additionally, a 
baculovinis expression system can be used for expression in insect cells as reviewed by 
Luckow and Summers, Bio/Technolagy 6:47 (1988). 

In another preferred embodiment, the Hu-Asp polypeptide is expressed in mammalian 
host cells. Non-iimiting examples of suitable mammalian cell lines include the COS-7 line of 
monkey kidney cells (Gluzman et ai. Cell 23:175 (1981)), human embyonic kidney cell line 
293. and Chinese hamster ovary (CHO) cells. Preferably, Chinese hamster ovary (CHO) cells 
are used for expression of Hu-Asp proteins (Example 1 1). 

The choice of a suitable expression vector for expression of the Hu-Asp polypeptides 
of the invention will of course depend upon the specific manunalian host cell to be used, and 
is within the skill of the ordinary artisan. Examples of suitable expression vectors include 
pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). A preferred vector for expression of 
Hu-Asp polypeptides is pcDNA3.1-Hygro (Inviurogen). Expression vectors for use in 
mammalian host cells may include transcriptional and translational control sequences derived 

26 



wo 00/17369 PCT/US99/20881 

from viral genomes. Commonly used promoter sequences and enhancer sequences which 
^ may be used in the present invention include, but are not limited to, those derived from human 

cytomegalovinis (CMV). Adenovirus 2. Polyoma virus, and Simian virus 40 (SV40). 

Methods for^hc consuiiction of manmialian expression vectors are disclosed, for example, in 
iQ- 5 Okayama and Berg {Mol. Cell Biol J:280 (1983)); Cosman et al {Mol Immunol 25:935 

(1986)); Cosman et al. {Naxure 312:76S (1984)); EP-A-0367566; and WO 91/18982. 

The polypeptides of the present invention may also be used to raise polyclonal and 

monoclonal antibodies, which are useful in diagnostic assays for detecting Hu-Asp 

polypeptide expression. Such antibodies may be prepared by conventional techniques. See, 
10 for example. Antibodies: A Laboratory Manual, Harlow and Land (eds.). Cold Spring Harbor 

Laboratory Press, Cold Spring Harbor, N.Y., (1988); Monoclonal Antibodies, Hybridomas: A 
20 New Dimension in Biological Analyses, Kennet et al. (eds.). Plenum Press, New York (1980). 

Synthetic peptides comprising portions of Hu-Asp containing 5 to 20 amino acids may also be 

used for the protkiction of polyclonal or monoclonal antibodies after linkage to a suitable 
!5 carrier protein including but not limited to keyhole limpet hcmacyanin (KLH), chicken 

25 

ovalbumin, or bovine serum albumin using various cross-Unking reagents including 
caibodimides, gtutaraldehyde, or if the peptide contains a cysteine, N-metbylmaleimide. A 
preferred peptide for immunization when conjugated to KLH contains the C-terminus of 
20 Hu.AspI or Hu-Asp2 comprising QRRPRDPEVVNDESSLVRHRWK or 

20 LRQQHDDFADDISUJC respectively. 

The Hu-Asp nucleic acid molecules of the present invention are also valuable for 
chromosome identification, as they can hybridize with a specific location on a human 
chromosome. Hu-Asp 1 has been localized to chromosome 21. while Hu-Asp2 has been 
localized to chromosome I Iq233-24.I. There is a current need for identifying particular sites 

25 on the chromosome, as few chromosome marking reagents based on actual sequence data 
(repeat polymorphisms) are presently available for maricing chromosomal location. Once a 
sequence has been mapped to a precise chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic map data. The relationship 
between genes and diseases that have been mapped to the same chromosomal region can then 

45 

30 be identified through linkage analysis, wherein the coinheritance of physically adjacent genes 
is determined. Whether a gene appearing to be related to a particular disease is in fact the 
cause of the disease can then be determined by comparing the nucleic acid sequence between 
so affected and unaffected individuals. 

27 
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In another enrfxxliment, the invention relMcs to a method of assaying Hu-Asp function, 
specifically Hu-Asp2 function which involves incubating in solution the Hu-Asp polypeptide 
with a suitable substrate including but not limited to a synthetic peptide containing the ^ 
secTctase cleavage site of APP, preferably one containing the mutation found in a Swedish 
kindred with inherited AD in which KM is changed to NL, such peptide comprising the 
sequence SEVNLDAEFR in an acidic buffering solution, preferably an acidic buffering 
solution of pK5.5 (see Exan^Ie 12) using cleavage of the peptide monitored by high 
performance liquid chromatography as a measure of Hu-Asp proteolytic activity. Preferred 
assays for proteolytic activity utilize internally quenched peptide assay substrates. Such 
suitable substrates include peptides which have attached a paired flurophore and quencher 
including but not limited to coumarin and dinicrophenol. respectively, such that cleavage of 
the peptide by the Hu-Asp results in increased fluorescence due to physical separation of the 
flurophore and quencher. Preferred colorimetric assays of Hu-Asp proteolytic activity utilize 
other suitable substrates that include the P2 and PI amino acids comprising the recognition 
site for cleavage linked to o-nitrophenol through an amide linkage, such that cleavage by the 
Hu-Asp results in an increase in optical density after altering the assay buffer to alkaline pH. 

In another embodiment, the invention relates to a method for the identification of an 
agent that increases the activity of a Hu-Asp polypeptide selected from the group consisting of 
Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method comprising 

(a) determining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide 
determined in the absence of said test agent; 

whereby a higher level of activity in the presence of said test agent than in the absence of said 
test agent indicates that said test agent has increased the activity of said Hu-Asp polypeptide. 
Such tests can be performed with Hu-Asp polypeptide in a cell free system and with cultured 
cells that express Hu-Asp as well as variants or isofoims thereof. 

In another embodiment, the invention relaes to a method for the identification of an 
agent that decreases the activity of a Hu-Asp polypeptide selected from die group consisting 
of Hu-Asp 1, Hu-Asp2(a), and Hu-Asp2(b), the method comprising 
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(a) deiemuning the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent: and 

(b) comparing the activity of said Hu-Asp polypeptide detcnnincd in the 
. presence of said test agent to the activity of said Hu-Asp polypeptide 

> detennined in the absence of said test agent; 

whereby a lower level of activity in the presence of said test agent than in the absence of said 
test agent indicates that said test agent has decreased the activity of said Hu-Asp polypeptide. 
Such tests can be performed with Hu-Asp polypeptide in a cell tee, system and with cultured 
cells that express Hu-Asp as well as variants or isofonns thereof. 

) In another embodiment, the invention relates to a novel cell line (HEK125.3 cells) 

for measuring processing of amyloid ^ peptide (AP) from the amyloid protein precursor 
(APP). The cells are stable transfoimants of human embryonic kidney 293 cells (HEK293) 
with a bicistronic vector derived from pIRES-EGFP (Clontech) containing a nwdificd 
human APP cDNA, an internal ribosome entry site and an enhanced green fluorescent 
protein (EGFP) cDN A in the second cistron. The APP cDN A was modified by adding two 
lysine codons to the carboxyl terminus of the APP coding sequence. This increases 
processing of Ap peptide from human APP by 2-4 fold. This level of Ap peptide 
processing is 60 fold higher than is seen in nontiansformed HEK293 cells. HEKI25.3 cells 
will be useful for assays of compounds that inhibit Ap peptide processing. This invention 
also includes addition of two lysine residues to the C-lcrminus of other APP isofonns 
including the 751 and 770 amino acid isoforms, to isofotms of APP having mutations found 
in human AD including the Swedish KM-^NL and V717— >F mutations, to C-terminal 
fragments of APP, such as those beginning with the p-secretase cleavage site, to C-terminal 
fragments of APP containing the p-secrctase cleavage site which have been operably linked 
to an N-teiminal signal peptide for membrane insertion and secretion, and to C-terminal 
fragments of APP which have been operably linked to an N-terminat signal peptide for 
membrane insertion and secretion and a reporter sequence including but not limited to 
green fluorescent protein or alkaline phosphatase, such that p-secretasc cleavage releases 
the reporter protein from the surface of cells expressing the polypeptide. 

Having generally described the invention, the same will be more readily 
understood by reference to the following examples, which are provided by way of illustration 
and are not intended as limiting. 
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EXAMPLES 

Example 1: Development of a Search Algorithm Useful for the Identificadon of 
Aspartyl Proteases^ and Identification of C. elegans Aspartyl Protease 
. . Genes in Wormpep 22: 

Materials and Methods: 

Classical aspartyl proteases such as pepsin and renin possess a two-domain structure 
which folds to bring two aspartyl residues into proximity within the active site. These are 
embedded in the short tripeptide motif DTG, or more rarely, DSG. The DTG or DSG 
active site motif appears at ^ui residue 25-30 in the enzyme, but at about 65-70 in the 
proenzyme (prorenin, pepsinogen). This motif appears again about 150-200 residues 
downstream. The proenzyme is activated by cleavage of the N-tcrminal prodomain. This 
pattern exemplifies the double domain structure of the modem day aspartyl enzymes which 
apparently arose by gene duplication and divergence. Thus; 

NH: X D^G Y D^*^G C 

where X denotes the beginning of the enzyme, following the N-tcrminal prodomain, and Y 
denotes the center of the molecule where the gene repeat begins again. 

In the case of the retroviral enzymes such as the HIV protease, they represent only a 
half of the two-domain stnicmres of well-known enzymes like pepsin, cathepsin D, renin, 
etc. They have no prosegment. but are carved out of a polyprotcin precursor containing the 
gag and pol proteins of the vims. They can be represented by: 

NH2 D"TG ClOO 

This "monomer" only has about 100 aa, so is extremely parsimonious as compared to the 
other aspartyl protease "dimers" which have of the order of 330 or so aa, not counting the 
N-tenninal prodomain. 

The limited length of the cukaryotic aspartyl protease active site motif makes it 
difficult to search EST collections for novel sequences. EST sequences typically average 
250 nucleotides, and so in this case would be unlikely to span both aspartyl protease active 
site motifs. Instead, we tmned to the C. elegans genome. The C. elegans genome is 
estimated to contain around 13,000 genes. Of these, roughly 12,000 have been sequenced 
and the corresponding hypothetical open reading frame (ORF) has been placed in the 
database Woiinpepl2. We used this database as the basis for a whole genome scan of a 
higher eukaryote for novel aspanyl proteases, using an algorithm thai we developed 
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Specifically for this puipose. The following AWK script for locating proteins containing two 
^ DTG or DSG motifs was used for the search, which was repeated four times to recover all 

pairwise combinations of the aspanyi motif. 

BEGIN {RS=">") /* defines ">'* as record separator for FAST A format */ 

5 { 

'0 pos = index(SO "DTG") /'finds "DTG" in record*/ 

if (pos>0) { 

lesi = substr($0,pos+3) /'get rest of record after first DTG*/ 
pos2 = iadcx(rcst."DTG") /'find second DTG*/ 
10 if (pos2>0) prinif ("%s%s\n",">",SO) } /*report hits*/ 

15 ] 
) 

The AWK script shown above was used to search Wormpepl2, which was 
downloaded ^m ftp.sanger.ac.uk/pub/databases/wormpep. for sequence entries containing 
. 15 at least two DTG or DSG motifs. Using AWK limited each record to 3{X)0 characters or 
less. Thus, 35 or so larger records were diminated manually from 
Wormpepl2 as in any case these were unlikely to encode aspartyl proteases. 

25 

Results and Discussion: 

20 The Wormpcp 12 database contains 12,178 entries, although some of these (<10%) 

represent alternatively spliced transcripts from the same gene. Estimates of the number of 
genes encoded in the C. elegans genome is on the order of 13,000 genes, so Wonnpepl2 
may be estimated to cover greater than 90% of the C. elegans genome. 

Eukaryotic aspaityl proteases contain a two-domain structure, probably arising from 

25 ancestral gene duplication. Each domain contains the active site motif D(S/T)G located 
from 20-25 amino acid residues into each domain. The retroviral (e.g., HIV protease) or 
rctrotransposon proteases are homodimers of subunits which are homologous to a single 
eukaryotic aspanyi protease domain. An AWK script was used to search the Wormpcp 1 2 
database for proteins in which the D(S/T)G motif occurred at least twice. This identified 

30 >60 proteins with two DTG or DSG motifs. Visual inspection was used to select proteins 
in which the position of the aspartyl domains was suggestive of a two-domain strucmre 
meeting the criteria described above. 

45 

In addition, the PROSITE eukaryotic and viral aspartyl protease active site patlcm 
PS00141 was used to search Wormpepl2 for candidate aspartyl proteases. (Bairoch A.. 
35 Bucher P., Hofmann K-, The PROStTE database: its status in 1997, Nucleic Acids Res. 
^ 24:217-221(1997)). This generated an overlapping set of Wormpepl2 sequences. Of these, 
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seven sequences contained two DTG or DSG motifs and the PROSriE aspaityl protease 
active site pattern. Of these seven, three were found in the same cosmid clone (F21F8.3. 
F21F8.4, and F21F8.7) suggesting that they represent a family of proteins that arose by 
ancestral gene duplication. Two other ORFs with extensive homology to F2 1 F8.3 . F2 1 F8.4 
5 and F21F8.7 are present in the same gene cluster {F21F8.2 and F21F8.6), however, these 
contain only a single DTG motif. Exhaustive BLAST searches with these seven sequences 
against WormpepI2 failed to reveal additional candidate aspartyl proteases in the C 
elegans genome containing two repeats of the DTG or DSG motif. 

BLASTX search with each C eUgans sequence against SWISS-PROT, GcnPep and 
10 TREMBL revealed that R 12H7.2 was the closest worm homologuc to the known 

mammalian aspanyl proteases, and that T18H9.2 was somewhat more distantly related, 
while CEASPl. F21F8.3, F2IF8.4. and F21F8.7 formed a subcluster which had the least 
sequence homology to the manunallan sequences. 
Discussion: 

IS APP. the presenilins, and p35, the activator of cdtc5. all undergo intracellular 

proteolytic processing at sites which conform to the substrate specificity of the HIV 
protease. Dysregulation of a cellular aspartyl protease with the same substrate specificity, 
might therefore provide a unifying mechanism for causation of the plaque and tangle 
pathologies in AD. Therefore, we sought to identify novel human aspartyl proteases. A 

20 whole genome scan in C elegans identified seven open reading frames that adhere to the 
aspartyl protease profile that we had identified. These seven aspartyl proteases probably 
comprise the complete complement of such proteases in a simple, multicellular eukaryote. 
These include four closely related aspartyl proteases unique to C. elegans which probably 
arose by duplication of an ancestral gene. The other three candidate aspaityl proteases 

25 (T18H9.2, R12H7.2 and CI ID2.2) were found to have homology to mammalian gene 
sequences. 
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Example Z: Identification of Novel Human Aspartyl Proteases Using Database 
Mining by Genome Bridging 

Materials and Methods: 

Cotrtputer-assisted analysis of EST databases, cDNA , and predicted polypeptide 
sequences: 

Exhaustive homology searches of EST daiabases with the CEASPl, F21F8.3, 
F21F8.4, and F21F8.7 sequences failed to reveal any novel mammalian homotogues. 
TBLASTN searches with R12H7.2 showed homology to cathepsin D, cathcpsin E, 
pepsinogen A, pepsinogen C and renin, particulariy around the DTG motif within the active 
site, but also failed to identify any additional novel mammalian aspartyl proteases. This 
indicates that the C. elegans genome probably contains only a single lysosomal aspartyl 
protease which in mammals is represented by a gene family that arose through duplication 
and consequent modification of an ancestral gene. 

TBLASTN searches with T18H9.2, the remaining C. elegans sequence, identified 
several ESTs which assembled into a contig encoding a novel human aspartyl protease (Hu- 
ASPl). As is described above in Example 1. BLASTX search with the Hu-ASPl contig 
against SWISS-PROT revealed that the active site motifs in the sequence aligned with the 
active sites of other aspartyl proteases. Exhaustive, repetitive rounds of BLASTN searches 
against LtfeScq. LifeSeqFL, and the public EST collections identified 102 EST from 
multiple cDN A libraries that assembled into a single contig. The 5 1 sequences in this 
contig found in public EST collections also have been assembled into a single contig 
(THC2 13329) by The Institute for Genome Research (TIGR). The TIGR annotation 
indicates that they failed to Hnd any hits in the database for the contig. Note that die TIGR 
contig is the reverse complement of the LifeSeq contig tiial we assembled. BLASTN search 
of Hu-ASPl against the rat and mouse EST sequences in ZooSeq revealed one homologous 
EST in each database (Incyte clone 70031 1523 and IMAGE clone 313341, GcnBank 
accession number W 10530, respectively). 

TBLASTN searches with the assembled DNA sequence for Hu-ASP 1 against both 
LifeSeqFL and the public EST databases identified a second, related human sequence (Hu- 
Asp2) represented by a single EST (2696295). Translation of this partial cDNA sequence 
reveals a single DTG motif which has homology to the active site motif of a bovine aspartyl 
protease, NMl. 
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BLAST searches, conlig assemblies and multiple sequence alignments were 
perfonncd using the bioinformatics tools provided with the LifeSeq, LifeSeqFL and 
LifcSeq Assembled databases from Incyic. Predicted protein moti/s were identified using 
either the PpoStte dictionary (Motifs in GCG 9) or the Pfam database. 
Full-length cDNA cloning of Hu-Aspl 

The open reading irame of C elegans gene T18H9.2CE was used to query incyie 
LifcSeq and LifcScq-FL databases and a single electronic assembly referred to as 
1863920CE1 was detected. The 5' most cDNA clone in this contig. 1863920, was obtained 
from Incyte and completely sequenced on both strands. Translation of the open reading 
frame contained within clone 1863920 revealed the presence of the duplicated aspartyl 
protease active site motif (DTG/DSG) but the 5' end was inconipleie. The remainder of the 
Hu-Aspl coding sequence was determined by 5" Marathon RACE analysis using a human 
placenta Marathon ready cDNA template (Clonctcch). A 3*-antisense oligonucleotide 
primer specific for the 5' end of clone 1863920 was paired with the 5'-scnse primer specific 
for the Marathon ready cDNA synthetic adaptor in the PGR. Specific PGR products were 
directly sequenced by cycle sequencing and the resulting sequence assembled with the 
sequence of clone 1 863920 to yield the complete coding sequence of Hu-Asp- 1 (SEQ ID 
No. 1). 

Several interesting features are present in the primary amino acid sequence 
of Hu-Aspl (Figure 1, SEQ ID No. 2). The sequence contains a signal peptide (residues 1- 
20 in SEQ ID No. 2), a pro-segment, and a catalytic domain containing two copies of the 
aspartyl protease active site motif (DTG/DSG). The spacing between the first and second 
active site motifs is about 200 residues which should correspond to the expected size of a 
single, eukaryotic aspartyl protease domain. More interestingly, the sequence contains a 
predicted transmembrane domain (residues 469-492 in SEQ ID No.2) near its C-terminus 
which suggests that the protease is anchored In the membrane. This feamre is not found in 
any other aspanyl protease. 
Cloning of a full-length Hu-Asp-2 cDNAs: 

As is described above in Example 1, genome wide scan of the Caenorhabditis 
elegans "database WoimPepl2 for putative aspartyl proteases and subsequent mining of 
human EST databases revealed a human ortholog to the C. elegans gene T18H9.2 referred 
to as Hu-Aspl. The assembled coniig for Hu-Aspl was used to query for human paralogs 
using the BLAST search tool in human EST databases and a single significant match 
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(2696295CE1) with approximately 60% shared identity was found in the lifeSeq FL 
database. Similar queries of either gblOSPubEST or the family of human databases 
available from TIGR did not identify similar EST clones. cDNA clone 2696295. identified 
by single pass sequence analysis from a human uterus cDNA library, was obtained from 
5 Incyte and completely sequence on both strands. This clone contained an incomplete 1266 
bp open-reading frame that encoded a 422 amino acid polypeptide but lacked an initiator 
ATG on the 5' end. Inspection of the predicted sequence revealed the presence of the 
duplicated aspartyl protease active site motif DTG/DSG, separated by 194 amino acid 
residues. Subsequent queries of later releases of the LifeSeq EST database identified an 

10 additional ESTs, sequenced from a human astrocyte cDNA library (4386993), that appeared 
to contain additional 5* sequence relative to clone 2696295. Clone 4386993 was obtained 
from Incyte and completely sequenced on both strands. Comparative analysis of clone 
4386993 and clone 2696295 confirmed that clone 4386993 extended the open-reading 
frame by 31 amino acid residues including two in-frame translation initiation codons. 

15 E>espite the presence of the two in-frame ATGs. no in-frame stop codon was observed 
upsuream of the ATG indicating that the 4386993 may not be full-length. Fuithennorc, 
alignment of the sequences of clones 2696295 and 4386993 revealed a 75 base pair 
insertion in clone 2696295 relative to clone 4386993 that results in the insertion of 25 
additional amino acid residues in 2696295. The remainder of the Hu-A5p2 coding sequence 

20 was determined by 5* Marathon RACE analysis using a human hippocampus Marathon 
ready cDNA template (Clonetech). A 3'-antisense oligonucleotide primer specific for the 
shared 5'-region of clones 2696295 and 4386993 was paired with the 5'-sense primer 
specific for the Marathon ready cDN A synthetic adaptor in the PGR. Specific PGR 
products were directly sequenced by cycle sequencing and the resulting sequence assembled 

25 with the sequence of clones 2696295 and 4386993 to yield the complete coding sequence 
of Hu-Asp2(a) (SEQ ID No. 3) and Hu-Asp2(b) (SEQ ID No. 5), respectively. 

Several interesting feamres are present in the primary amino acid sequence of Hu- 
Asp2(a) (Figure 2 and SEQ ID No. 4) and Hu-Asp-2(b) (Figure 3, SEQ ID No. 6). Both 
sequences contain a signal peptide (residues 1-21 in SEQ ID No. 4 and SEQ ID No. 6), a 

30 pro-segment, and a catalytic domain containing two copies of the aspartyl protease active 
site motif (DTG/DSG). The spacing between the first and second active site motifs is 
variable due to the 25 amino acid residue deletion in Hu-Asp-2(b) and consists of 168- 
versus- 194 amino acid residues, for Hu-Asp2{b) and Hu-Asp-2(a). respectively. More 
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interestingly, both sequences contains a predicted transmembrane domain (residues 455-477 
in SEQ ID No.4 and 430-452 in SEQ ID No. 6) near their C-termini which indicates that the 
protease is anchored in the membrane. This feature is not found in any other aspanyl 
protease except Hu- Asp 1 . 

5 Example 3. Molecular cloning of mouse Asp2 cDNA and genomic DNA. 

Cloning and characterization of murine Asp2 cDNA— The murine ortholog of Hu_Asp2 

was cloned using a combination of cDN A library screening, PCR, and genomic cloning. 

AppToximaieiy 500,000 independent clones from a mouse brain cDNA library were 

screened using a '^P-Iabelcd coding sequence probe prepared from Hu_Asp2. Replicate 

10 positives were subjected to DNA sequence analysis and the longest cDNA contained the 
entire 3' untranslated region and 47 amino acids in the coding region. PCR amplification 
of the same mouse brain cDNA library with an antisense oligonucleotide primer specific for 
the 5 '-most cDNA sequence determined above and a sense primer specific for the 5' region 
of human Asp2 sequence followed by DNA sequence analysis gave an additional 980 bp of 

15 the coding sequence. The remainder of the 5* sequence of murine Asp-2 was derived from 
genomic sequence (sec below). 

Isolation and sequence analysis of the murine Asp-l gene — ^A murine EST sequence 
encoding a portion of the murine Asp2 cDNA was identified in the GenBank EST database 
using the BLAST search tool and the Hu-Asp2 coding sequence as the query. Gone 

20 g3 160898 displayed 88% shared identity to the human sequence over 352 bp. 

Oligonucleotide primer pairs specific for this region of murine Asp2 were then synthesized 
and used to amplify regions of the murine gene. Murine genomic DNA, derived from strain 
129/SvJ, was amplified in the PCR (25 cycles) using various primer sets specific for murine 
Asp2 and the products analyzed by agarose gel electrophoresis. The primer set Zoo-1 and 

25 Zoo-4 amplified a 750 bp fragment that contained approximately 600 bp of intron sequence 
based on comparison to the known cDNA sequence. This primer set was then used to 
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screen a murine B AC library by PCR, a single genomic clone was isolated and this cloned 
was confirmed coniain the murine Asp2 gene by DNA sequence analysis. Shotgun DN A 
sequencing of this Asp2 genomic clone and comparison lo the cDN A sequences of both 
Hu_Asp2 and the partial murine cDNA sequences defined the fiill-length sequence of 
murine Asp2 (SEQ ID No. 7). The predicted amino acid sequence of murine Asp2 (SEQ ID 
No. 8) showed 96.4% shared identity (GCG BestFit algorithm) with 18/501 amino acid 
residue substitutions compared to the human sequence (Figure 4). 

Example 4: Tissue Distribution of Expression oJHu-Aspl Transcripts: 
Materials and Methods: 

The tissue distribution of expression of Hu- Asp-2 was determined using multiple 
tissue Northem blots obtained from Clonetech (Palo Alto, CA). Ircytc clone 2696295 in 
the vector pINCY was digested to completion with EcoRUNotl and the 1.8 kb cDNA insert 
purified by preparative agarose gel electrophoresis. This fragment was radiolabeled to a 
specific activity > 1 X lO' dprn/^g by random priming in the presence of [a-'^P-dATP] 
(>3000 Ci/mmol, Amersham, Arlington Heights, IL) and Klenow fragment of DNA 
polymerase L Nylon filters containing denatured, size fractionated poly A* RNAs isolated 
from different human tissues were hybridized with 2x10* dpm/ml probe in ExpressHyb 
buffer (Clonetech, Palo Alto, CA) for 1 hour at 68 "^C and washed as recomiiwndcd by the 
manufacture. Hybridization signals were visualized by autoradiography using BioMax XR 
film (Kodak, Rochester, NY) with intensifying screens at -80 "C. 
Results and Discussion: 

Limited information on the tissue distribution of expression of Hu-Asp-2 transcripts 
was obtained from database analysis due to the relatively small number of ESTs detected 
using the methods described above (< 5). In an effort to gain further information on the 
expression of the Hu-Asp2 gene, Northern analysis wa.s employed to determine both the 
size(s) and abundance of Hu-Asp2 transcripts. PolyA*^ RNAs isolated from a series of 
peripheral tissues and brain regions were displayed on a solid support following separation 
under denaturing conditions and Hu-Asp2 transcripts were visualized by high stringency 
hybridization to radiolabeled insert from clone 2696295. The 2696295 cDNA probe 
visualized a constellation of transcripts that migrated with apparent sizes of 3.0kb, 4.4 kb 
and 8.0 kb with the latter two transcript being the most abundant. 
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Across the tissues surveyed, Hu-Asp2 transcripts were most abundant in pancreas 
and brain with lower but detectable levels observed in all other tissues examined except 
thymus and PBLs. Given the relative abundance of Hu-Asp2 transcripts in brain, the 
regional expression in brain regions was also established. A similar constellation of 
transcript sizes were detected in all brain regions examined [cerebellum, cerebral cortex, 
occipital pole, frontal lobe, temporal lobe and putamcnl with the highest abundance In the 
medulla and spinal cord. 

Example 5: Northern Blot Detection of HuAsp-1 and HuAsp-2 Transcripts in 
Human Ceil Lines: 

A variety of human cell lines were tested for their ability to produce Hu-Aspl and 

Asp2 mRNA. Human embryonic kidney (HEK-293) cells, African green monkey (Cos-7) 
cells, Chinese hamster ovary (CHO) cells, HELA cells, and the neuroblastoma cell line 
IMR-32 were all obtained from the ATCC. Cells were cultured in DME containing 10% 
FCS except CHO cells which were maintained in a-MEM/10% FCS at 37 **C in 5% CO2 
until they were near confluence. Washed monolayers of cells (3 X 10^) were lyscd on the 
dishes and poly A* RNA extracted using the Qiagen Oligotex Direct mRNA kit. Samples 
containing 2 ^g of poly A* RNA from each cell line were fractionated under denaturing 
conditions (glyoxal-treaied), transferred to a solid nylon membrane support by capillary 
action, and transcripts visualized by hybridization with random-primed labeled (^^P) coding 
sequence probes derived from cither Hu-Aspl or Hu-Asp2. Radioactive signals were 
detected by exposure to X-ray film and by image analysis with a Phosphorlraager. 

The Hu-Aspl cDNA probe visualized a similar constellation of transcripts (2.6 U> 
and 3.5 kb) that were previously detected is human tissues. The relative abundance 
determined by quantification of the radioactive signal was Cos-7 > HEK 292 = HELA > 
IMR32. 

The Hu-Asp2 cDNA probe also visualized a similar constellation of transcripts 

compared to tissue (3.0 kb, 4.4 kb. and 8.0 kb) with the following relative abundance; HEK 

293 > Cos 7 > IMR32 > HELA. 

Exaoqjle 6: Modification of AFP to increase Ap processing for in vitro 
screening 

Human cell lines that process Ap peptide from APP provide a means to screen in 
cellular assays for inhibitors of (J- and y-secretase. Production and release of Ap peptide 
into the culture supernatant is monitored by an enzyme-linked immunosorbent assay (EIA). 
Although expression of APP is widespread and both neural and non-neuronal cell lines 
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process and release AP peptide, levels of endogenous APP processing are low and difFicult 
to detect by ElA. Ap processing can be increased by expressing in transformed cell lines 
mutations of APP thai enhance Ap processing. Wc made the serendipitous observation that 
addition of two lysine residues to the carboxyl terminus of APP695 increases AP processing 
10 5 still fiiither. This allowed us to create a transformed cell line that releases AP peptide into 

the culture medium at the remarkable level of 20,000 pg/ml. 
Materials And Methods 
Matericds: 

15 

Human embryonic kidney cell line 293 (HEK293 cells) were obtained internally. 

10 The vector pIRES-EGFP was purchased from Clontech. Oligonucleotides for mutation 

using the polymerase chain reaction (PCR) were purchased from Genosys. A plasmid 

20 containing human APP695 (SEQ ID No. 9 [nucleotide] and SEQ ID No. 10 {amino acid]) 

was obtained from Northwestern University Medical School. This was subcloned into pSK 

(Stratagcne) at the Not I site creating the plasmid pAPP695. 

15 Mutagenesis protocol: 

The Swedish mutation (K670N. M671L) was introduced into pAPP695 using the 

Straiagenc Quick Change Mutagenesis Kit to create the plasmid pAPP695NL (SEQ ID No. 

1 1 [nucleotide] and SEQ ID No. 12 [amino acid]). To introduce a di-lysine motif at the C- 

terminus of APP695 , the forward primer #276 5 * GACTGACCACTCGACC AGGTTC 

20 (SEQ ID No. 47) was used with the "patch" primer #274 5' 

CGAATTAAATTCCAGCACACTGGCTACrrCTTGTTCTGCATCTCAAAGAAC(SEQ 
ID No. 48) and the flanking primer #275 CGAATTAAATTCCAGCACACTGGCTA (SEQ 
ID No. 49) to modify the 3' end of the APP695 cDNA (SEQ ID No. 15 [nucleotide] and 
SEQ ID No. 16 [amino acid]). This also added a BstXl restriction site that will be 

25 compatible with the BstX 1 site in the multiple cloning site of pIRES-EGFP. PCR 

amplification was performed with a Cloniech HF Advantage cDNA PCR kit using the 
polymerase mix and buffers supplied by the manufacturer. For "patch" PCR, the patch 
primer was used at l/20th the molar concentratioti of the flanking primers. PCR 
amplificalion products were piirified using a QlAquick PCR purification kit (Qiagcn). 

30 After digestion with restriction enrymes, products were separated on 0.8% agarose gels and 
then excised DNA fragments were purified using a QlAquick gel extraction kit (Qiagen). 
To reassemble a modified APP695-Sw cDNA, the 5' Notl-Bgl2 fragment of the 
^ APP695-SW cDNA and the 3' Bgl2-BstXl APP695 cDNA fragment obtained by PCR were 
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Ugatcd into pIRES-EGFP plasmid DNA opened ai the Not 1 and BstXl sites. Ligations 
were perfonned for 5 minutes at room temperature using a Rapid DNA Ligation kit 
(Boehringer Mannheim) and transformed into Library Efficiency DH5a Competent Cells 
(GibcoBRL-Lifc Technologies). Bacterial colonics were screened for inserts by PCR 
amplification using primers #276 and #275. Plasmid DNA was purified for mammalian 
cell transfection using a QIAprep Spin Miniprcp kit (Qiagen). The construct obtained was 
designated pMG1253 (APPSW-KK, SEQ ID No. 17 [nucleotide) and SEQ ID No. 18 
[amino acid]). 

Mammalian Cell Transfection: 

HEK293 cells for transfection were grown to 80% confluence in Dulbccco's 
modified Eagle's medium (DMEM) with \0% fetal bovine serum. Cotransfections were 
performed using UpofectAmine (Gibco-BRL) with 3 jig pMG 125.3 DNA and 9 ^.g 
pcDNA3. 1 DNA per 10 x 10* cells. Three days posttransfcciion, cells were passaged into 
medium containing G418 at a concentration of 400 ^tg/ml. After three days growth in 
selective medium, cells were sorted by their fluorescence. 
Clonal SeUction ofJ25.3 cells by FACS: 

Cell samples were analyzed on an EPICS Elite ESP flow cytomctcr (Coulter, 
Hialeah, FL) equipped with a 488 nm excitation line supplied by an air-cooled argon laser. 
EGFP emission was measured through a 525 nm band-pass filter and fluorescence intensity 
was displayed on a 4-decade log scale after gating on viable cells as determined by forward 
and right angle light scatter. Single green cells were separated into each well of one 96 well 
plate containing growth medium without G4 1 8. After a four day recovery period, G41 8 
was added to the medium to a final concentration of 400 [ig/ml. After selection, 32% of the 
wells contained expanding clones. Wells with clones were expanded from the 96 well plate 
to a 24 welt plate and then a 6 well plate with the fastest growing colonies chosen for 
expansion at each passage. The fmal cell line selected was the fastest growing of the final 
six passaged. This clone, designated 125.3, has been maintained in G4 1 8 at 400 ug/ml with 
passage every four days into fresh medium. No loss of AP production of EGFP 
fluorescence has been seen over 23 passages. 

A/3 ElA Analysis (Double Antibody Sandwich EUSA for hAfi 1-40/42): 

Cell culture supematants harvested 48 hr after transfection were analyzed in a 
standard A^ ElA as follows. Human A^ 1-40 or M2 was measured using monoclonal 
antibody (mAb) 6E10 (Senetek, St. Louis, MO) and biotinylated rabbit antisenmi 162 or 
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164 (New York State Institute for Basic Research, Staten Island, NY) in a double antibody 
sandwich EUSA. The capture antibody 6E 10 is specific to an epitope present on the N- 
teiminal amino acid residues 1-16 of hAp. The conjugated detecting antibodies 162 and 
164 are specific for hAp 1-40 and 1-42, respectively. Briefly, a Nunc Maxisorp 96 well 
immunoplate was coated with 100 ^il/well of mAb 6E10 (5|ig/ml) diluted in 0.1 M 
carbonate-bicarbonate buffer, pH 9.6 and incubated at 4''C overnight. After washing the 
plate 3x with 0.0 IM DPBS (Modified Dulbccco's Phosphate Buffered Saline (0.008M 
sodium phosphate, 0.002M potassium phosphate, 0. 14M sodium chloride, 0.01 M 
potassium chloride, pH 7.4) from Pierce. Rockford, D) containing 0.05% of Tween-20 
(DPBST), the plate was blocked for 60 min with 200^1 of 10% norma] sheep serum 
(Sigma) in 0.01 M DPBS to avoid non-specific binding. Human Ap MO or 1-42 standards 
lCK)pJ/weU (Bachem, Tonance, CA) diluted, from a Img/ml stock solution in DMSO. in 
culture medium was added after washing the plate, as well as lOO^J/well of sample, 
e.g.coDditioned medium of transfectcd cells. The plate was incubated for 2 hours at room 
temperamre and 4**C overnight. The next day, after washing the plate, 100>U/well 
biotinylated rabbit antiserum 1 62 1 :400 or 164 1:50 diluted in DPBST + 0.5% BSA was 
added and incubated at room temperature for Ihr 15 min. Following washes, 100(jJ/weU 
neutravidin-horscradish peroxidase (Pierce. Rockford, II) diluted 1: 10,000 in DPBST was 
applied and incubated for 1 hr at room temperature. After the last washes lOOpl/well of o- 
phenylnediamine dihydrochloride (Sigma Qiemicals, St. Louis, MO) in 50mM citric 
acid/lOOmM sodium phosphate buffer (Sigma Chemicals, St. Loui.s, MO), pH 5.0. was 
added as substrate and the color development was monitored at 450nm in a kinetic 
microplaie reader for 20 min. using Soft max Pro software. All standards and samples were 
run in triplicates. The samples with absorbance values falling within the standard curve 
were extrapolated from the standard curves using Soft max Pro software and expressed in 
pg/ml culnire medium. 
Results: 

Addition of two lysine residues to the carboxyl terminus of APP695 greatly 
increases Ap processing in HEK293 cells as shown by transient expression (Table 1). 
Addition of the di-Iysine motif to APP695 increases Ap processing to thai seen with the 
APP695 containing the Swedish mutation. Combining the di- lysine motif with the Swedish 
mutation further increases processing by an additional 2.8 fold. 
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Cotransformaiion of HEK293 cells with pMG 125.3 and pcDNA3.1 allowed dual 

selection of transformed cells for G4 18 resistance and hi^ level expression of EGFP. 

After clonal selection by FACS, the cell line obtained, produces a remarkable 20,000 pg Ap 

peptide per ml of culture medium after growth for 36 hr in 24 well plates. Production of 

AP peptide under various growth conditions is summarized in Table 2. 

TABLE 1 . Release of Ap peptide, into the culture medium 48 hr after transient 
transfcction of HEK293 cells with the indicated vectors containing wildtype or modified 
APP. Values tabulated are mean + SD and P-value for pairwise comparison using Student's 
t-tcst assuming unequal variances. 



APP Construct 


Ap 1-40 peptide 
(PB/ml) 


Fold Increase 


P-valuc 


pIRES-EGFP vector 


147+28 


1.0 




wtAPP695( 142.3) 


194+15 


13 


0.051 


wtAPP695-KK (124.1) 


424 + 34 


2.8 


3 X 10-5 


APP695-SW (143.3) 


457 + 65 


3.1 


2x 10-3 


APP695-SwKK (125.3) 


1308 + 98 


8.9 


3x 10^ 



TABLE 2. Release of Ap peptide from HEK 1 25.3 cells under various growth 
conditions. 



Type of Culmre 


Volume of 


Duration of 


Ab 1-40 


Ab 1-42 


Plate 


Medium 


Culture 


(pg/ml) 


(Pfi/ml) 


24 well plate 


400 ul 


36 hr 


28,036 


1,439 



Example 7: Antisense oligomer inhibidon ofAbeta processing in HEKI25.3 cells 
The sequences of Hu-Aspl and Hu-Asp2 were provided to Sequitur. Inc (Nadclt, 

MA) for selection of targeted sequences and design of 2nd generation chimeric antisense 

oligomers using prorictary technology (Sequitur Vcr. D Pat pending #3002). Antisense 

oligomers Lot# S644, S645, S646 and S647 were targeted against Aspl. Antisense 

oligomers Lot# S648, S649, S650 and S651 were targeted against Asp2. Control antisense 

oligomers Lot# S652, S653, 5655, and S674 were targeted against an irrelevant gene and 

antisense oligomers Lot #S656. S657, S658, and S659 were targeted against a second 

irrelevant gene. 

For iransfection with the antisense oligomers, HEK125.3 cells were grown to about 
50% confluence in 6 well plates in Minimal Essential Medium (MEM) supplemented with 
10% fetal calf serum. A stock solution of oligofectin G (Sequitur Inc., Natick, MA) at 2 
mg/ml was diluted to 50 (ig/ml in serum free MEM. Separately, the antisense oligomer 
stock solution at 100 ^M was diluted to 800 nM in Opti-M£M (GIBCO-BRL. Grand 
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Island, NY). The diluted stocks of oUgofcciin G and aniiscosc oligomer were then niixed at 
a ratio of 1:1 and incubated at room temperature. After 15 mio incubation, the reagent was 
diluted 10 fold into MEM containing 10% fetal calf seium and 2 ml was added to each well 
of the 6 well plate after first removing the old medium. After transfection, cells were 
grown in the continual presence of the oligofectin G/antisense oligomer. To monitor Ab 
peptide release, 400 fil of conditioned medium was removed {xriodically from the culture 
well and replaced with ftesh medium beginning 24 hr after transfection. Data reported are 
from culture supematants harvested 48 hr after transfection. 
Results: 

The 16 different antisense oligomers obtained from Sequitur Inc were transfected 
separately into HEK125.3 cells to determine their affect on Ap peptide processing. Only 
antisense oligomers targeted against Aspl & Asp2 reduced Abeta processing by HEK125.3 
cells with those targeted against Asp2 having a greater inhibitory effect. Both Ap (1-40) 
and A^ (M2) were inhibited by the same degree. In Table 3, percent inhibition is 
calculated with respect to untransfected cells. Antisense oligomer reagents giving greater 
than 50% inhibition are marked with an asterisk. Of the reagents tested, 3 of 4 antisense 
oligomers targeted against ASPl gave an average 52% inhibition of Ap 1-40 processing 
and 47% inhibition of Ap 1-42 processing. For ASP2, 4 of 4 antisense oligomers gave 
greater than 50% inhibition with an average inhibition of 62% for Ap 1^ processing and 
60% for Ap 1-42 processing. 

Table 3. Inhibition of Ap peptide release from HEKI25.3 cells treated with antisense 
oligomers. 



Gene Targeted 


Antisense Oligomer 


Abeta (1^) 


Abeta (1-42) 


Aspl-l 


S644 


62%* 


56%* 


Asp 1-2 


S645 


41%* 


38%* 


Asp 1-3 


S646 


52%* 


46%* 


AspM 


S647 


6% 


25% 


Asp2-1 


S648 


71%* 


67%* 


Asp2-2 


S649 


83%* 


76%* 


Asp2-3 


S650 


46%* 


50%* 


Asp2-4 


S651 


47%* 


46%* 


Conl-1 


S652 


13% 


18% 


Con 1-2 


S653 


359J> 


30% 


Conl-3 


S655 


9% 


18% 


Con 1-4 


S674 


29% 


18% 


Con2-l 


S656 


12% 


18% 


Con2-2 


S657 


16% 


19% 


Con2-3 


S658 


8% 


35% 
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C0112-4 S659 3% 18% 

5 



10' 



15 



20 



25 



30 



35 



40 



45 



44 



wo 00n7369 PCT/US99/20881 
Example 8. Demonstration of Hu-Asp2 p- Secretase Activity in Cultured Cells 

Several mytattons in APP associated with early onset Alzheimer's disease have been 
shown to alter A^ peptide processing. These flank the N- and C-teiminal cleavage sites 
that release AD from APP. These cleavage sites are referred to as the P-secrctase and f- 
5 secretase cleavage sites, respectively. Cleavage of APP at the p-sccrctase site creates a C- 
tenninal fiagment of APP containing '99 amino acids of 1 1,145 daltons molecular weight. 
The Swedish KM->NL mutation immediately upstream of the ^-secretase cleavage site 
causes a ^eral increase in production of both the 1 -40 and 1 -42 amino acid forms of AO 
peptide. The London VF mutation (V717->F in the APP770 isoform) has little effect on 

10 total AD peptide production, but appears to preferentially increase the percentage of the 
longer 1-42 amino acid form of AD peptide by affecting the choice of Y-sccretasc cleavage 
site used during APP processing. Thus, we sought to determine if these mutations altered 
the amount and type of AG peptide produced by cultured cells cotransfected with a 
construct directing expression of Hu-Asp2. 

15 Two experiments were perfomr^ which demonstrate Hu-Asp2 p-secictasc 

activity in culmicd cells. In the fu^t experiment, treatment of HEICI253 cells with antisense 
oligomers directed against Hu-Asp2 transcripts as described in Example 7 was found to 
decrease the amount of the C-terminal fragment of APP created by ^secretase cleavage 
(CTF99) (Figure 9). This shows that Hu-Asp2 acts dirccdy or indirectly to facilitate p- 

20 secretase cleavage. In the second experimeni, increased expression of Hu-Asp2 in 
transfected mouse Neuro2A cells is shown to increase accumui^on of the CrF99 p- 
secretase cleavage &agment (Figure 10). This increase is seen most easily when a mutant 
APP-KK clone containing a C-terminal di-iysine motif is used for transfcction. A further 
increase is seen when Hu-A5p2 is cotransfected with APP-Sw-KK containing the Swedish 

25 mutation KM -^NL. The Swedish mutton is known to increase cleavage of APP by the p- 
secretase. 



wo 00/17369 PCT/US99/2088I 

A second set of experiments demonstraie Hu-Asp2 facilitates •y-sccrciasc activity in 
coiransfection experimenis with human embryonic Jcidney HEK293 cells. Cotransfection of 
Hu-Asp2 with an APP-KK clone greatly increases production and release of soluble Apl-40 
and ApM2 peptides from HEK293 cells. There is a proportionately greater increase in the 
5 release of Api-42. A further increase in production of APM2 is seen when Hu-Asp2 is 
cotransfectcd with APP-VF (SEQ ID No. 13 [nucleotide] and SEQ ID No. 14 (amino acid]) 
or APP-VF-KK SEQ ID No. 19 [nucleotide] and SEQ ID No. 20 [amino acid]) clones 
containing the London mutation V717-»F. The V717^F mutation is known to alter 
cleavage specificity of the APP y-secretase such that the preference for cleavage at the Ap42 
10 site is increased. Thus, Asp2 acts directly or iudireclly to facihute y-secretase processing of 
APP at the p42 cleavage site. 

Materials 

Antibodies 6E10 and 4G8 were purchased from Scncick (Sl Louis, MO). Antibody 369 
was obtained from the laboratory of Paul Greengard at the Rockefeller University. 
15 Antibody C8 was obtained from the laboratory of Dennis Selkoe at the Harvard Medical 
School and Brigham and Women's Hospital. 
APP Constructs used 

The APP constructs used for transfection experiments comprised the following 

20 APP wild-type APP695 (SEQ ID No. 9 and No. 10) 

APP-Sw APP695 containing the Swedish KM-+NL mutation (SEQ ID No. 1 1 

and No. 12). 

APP-VF APP695 containing the London V->F mutation (SEQ ID No. 1 3 and 

No. 14) 

25 APP-KK APP695 containing a C-tcrminal KK motif (SEQ ID No. 1 5 and No. 

16). 

APP-Sw-KK APP695-S w containing a C-tcrminal KK motif (SEQ ID No. 1 7 and 
No. 18), 

APP-VF-KK APP695-VF containing a C-terminal KK motif (SEQ ID No. 19 and 
30 No. 20). 

These were inserted into the vector pIRES-EGFP (Clontech, Palo Alto CA) between the 

Notl and BstXl sites using appropriate linker sequences introduced by PGR. 

Transfection ofantisense oligomers orplasmid DNA constructs in HEK293 cells, 
35 HEK12SJ cells and Neuro-2A cells. 
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Human embryonic kidney HEK293 cells and mouse Neiiro-2a cells were transfcctcd with 
expression constructs using the Lipofectamine Plus reagent from Glbco/BRU Cells were 
seeded in 24 well tissue culture plates to a density of 70-80% confluence. Four wcils per 
plate were transfcctcd with 2 jig DNA (3: 1, APPrcotransfectant). 8|iJ Plus reagent, and 4(iJ 
Lipofectamine in OptiMEM. OptiMEM was added to a total volume of I ml, distributed 
200 \ii per well and incubated 3 hours. Care was taken to hold constant the ratios of the two 
plasmids used for cotransfection as well as the total amount of DNA used in the 
transfection. The iransfection media was replaced with DMEM, 10%FBS. NaPynivatc, 
with antibiotic/antimycotic and the cells were incubated under normal conditions (37", 5% 
CO2) for 48 hours. The conditioned media were removed to polypropylene tubes and 
stored at -80**C until assayed for the content of Ap 1-40 and Ap 1-42 by EIA as described in 
the preceding examples. Transfection of antisense oligomers into HEKI25.3 cells was as 
described in Example 7. 

Preparation of cell extracts, Western blot protocol 

Cells were harvested after being transfected with plasmid DNA for about 60 hours. 
First, cells were transferred to 15-ml conical tube from the plate and centrifuged at 1,500 
rpm for 5 min to remove the medium. The cell pellets were washed with PBS fOT one time. 
We then lysed the cells with lysis buffer (10 mM HEPES, pH 7.9. 150 mM NaQ, 10% 
glycerol, 1 mM EGTA, 1 mM EDTA, 0. 1 mM sodium vanadate and 1% NP-40). The lysed 
cell mixtures were centrifuged at 5000 rpm and the supernatant was stored at -20**C as the 
cell extracts. Equal amounts of extracts from HEK125.3 cells transfected with the Asp2 
antisense oligomers and controls were precipitated with antibody 369 that recognizes the C- 
terminus of APP and then CTF99 was detected in the immunoprecipitate with antibody 
6E10. The experiment was repeated using C8. a second precipitating antibody that also 
recognizes the C-tenninus of APP. For Western blot of extracts from mou.se Neuro-2a cells 
cotransfectcd with Hu-Asp2 and APP-KK, APP-Sw-KK, APP-VF-KK or APP-VF, equal 
amounts of cell extracts were electrophoresed through 4-10% or 10-20% Tricine gradient 
gels (NOVEX, San Diego, CA). Full length APP and the CTF99 P-secrctase product were 
detected with antibody 6E10. 

Results 



47 



wo 00/17369 PCT/US99/208SI 
Transfeciion of HEK12S.3 cells with Asp2-I or Asp2-2 antiscnse oligomers rcducti 

5 

production of the CTF p-secretasc product in comparison to cells similarly transfccted with 
control oligomers having the reverse sequence (Asp2-1 reverse & Asp2-2 reverse) 
^Q- In cotransfection experiments, cotransfection of Hu-Asp2 into mouse Neuro-2a cells with 

5 the APP-KK construct increased the formation of CTF99. This was further increased if Hu- 
Asp2 was coexprcssed with APP-Sw-KK, a mutant form of APP containing the Swedish 

15 

KM— >NL mutation that increases ^-secretase processing. 

Cotransfection of Hu-Asp2 with APP has little effect on A^ production but 
increases A^2 production above background (Table 4). Addition of the di-lysine motif to 
to the C-ienninus of APP increases A|} peptide processing about two fold, although A^40 and 
Ap42 production remain quite low (352 pg/ml and 21 pg/ml, respectively). Cotransfection 

25 

of Asp2 with APP-KK further increases both Ap40 and Ap42 production. The stimulation 
of AP40 production by Hu-A5p2 is more that 3 fold, while production of A]342 increases by 
more than 10 fold. Thus, cotransfection of Hu-Asp2 and APP-KK constructs preferentially 

20 

IS increases A^2 production. 

The APP V717— »F mutation has been shown to increase y-secretasc processing at the 
Ap42 cleavage site. Cotransfection of Hu- Asp2 with the APP-VF or APP-VF-KK 
constructs increased Ap42 production ( a two fold increase with APP-VF and a four-fold 
. increase with APP-VF-KK, Table 4), but had mixed effects on Ap40 production (a slight 

40 

20 decrease with APP-VF, and a two fold increase with APP-VF-KK in comparison to the 
pcDNA cotransfection control. Thus, the effect of Asp2 on Ap42 production was 
proportionately greater leading to an increase in the ratio of Ap42/total Ap. Indeed, the ratio 
of Ap42/total Ap reaches a very high value of 429E> in HEK293 cells cotransfcctcd with Hu- 
Asp2 and APP-VF-KK. 

50 
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Western blot showing reduction of CTF99 production by HEK.125.3 celis transfcctcd with 
antisense oligomers targeting the Hu-Asp2 mRNA. (right) Western blot showing increase in 
CTF99 production in mouse Neuro-2a cells cotransfecied with Hu-Asp2 and APP-KK. A 
further increase in CTF99 production is seen in cells cotransfected with Hu- Asp2 and APP- 
Sw-KK. 

Table 4. Results of cotransfecting Hu-Asp2 or pcDNA plasmtd DN A with various APP 
constructs containing the V7I7— mutation that modifies "f-secretase processing. 
Cotransfcction with Asp2 consistently increases the ratio of Ap42/total Ap. Values 
tabulated are Ap peptide pg/ml. 







pcDNA 
Cotranitfection 




Asp2 

Cotransfcction 








A^ 


Ap42 


Ap42/T0iat 




AP42 


A^2/Tot4l 


APP 


192+18 


<4 


<2% 


188±40 


8+10 


3.9* 


APP-VF 


118+15 


15+19 




85+7 


24+12 


22.4% 


APP-KK 


352+24 


21+6 


53% 


1062+101 


226+49 


17.5% 


APP-VF-KK 


230+31 


88+24 


27.7% 


49l±35 


355+36 


42% 



Example 9. Bacterial expression of human Asp2L 
Expression of recombinant Ru_Asp2L in E. coli. 

Hu-Asp2L can be expressed in E. coli after addition of N-terminal sequences such as a T7 
tag (SEQ ID No. 21 and No. 22) or a T7 tag followed by a caspasc 8 leader sequence (SEQ 
n> No. 23 and No. 24). Alternatively, reduction of the GC content of the 5' sequence by site 
directed mutagenesis can be used to increase the yield of Hu- Asp2 (SEQ ID No. 25 and No. 
26). In addition, Asp2 can be engineered with a proteolytic cleavage site (SEQ ID No. 27 
and No. 28). To produce a soluble protein after expression and refolding, deletion of the 
transmembrane domain and cytoplasmic tail, or deletion of the membrane proximal region, 
transmembrane domain, and cytoplasmic tail is preferred. 
Methods 
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PCR with primers containing appropriate linker sequences was used to assemble fusions of 
Asp2 coding sequence with N-terminal sequence modificaiions including a T7 tag (SEQ ID 
Nos. 21 and 22) or a T7-caspase 8 leader (SEQ ID Nos. 23 and 24). These constructs were 
cloned into the expression vector pci23a(+) [Novagen] in which a T7 promoter directs 
expression of a T7 tag preceding a sequence of multiple cloning sites. To clone Hu-Asp2 
sequences behind the T7 leader of pet23a+, the following oligonucleotides were used for 
amplification of the selected Hu-Asp2 sequence: 

#553=GTGGATCCACCCAGCACGGCATCCGGCTG (SEQ ID No. 35), 
#554=GAAAGCnTCATGACTCATCTGTCTGTGGAATG7TG (SEQ ID No. 36) which 
placed BamHI and Hindm sites flanking the S' and 3' ends of the insert, respectively. The 
Asp2 sequence was amplified from the full length Asp2(b) cDNA cloned into pcDNA3.1 
using the Advantage-GC cDNA PCR [Qontech] following the manufacturer's supplied 
protocol using annealing & extension at eS^C in a two-step PCR cycle for 25 cycles. The 
insert and vector were cut with BamHI and Hindm. puriried by electrophoresis through an 
agarose gel, then ligaied using the Rapid DNA Ligation kit [Bocrhinger Mannheim). The 
ligation reaction was used to transform the E. coli strain JM109 (Promega) and colonies 
were picked for the purification of plasmid (Qiagen.CJiaprep minispin) and DNA sequence 
analysis . For inducible expression using induction with isopropyl l>-D- 
thiogaiaciopyranosidfi (IPTG), the expression vector was transferred into E. coli su^n 
BL21 (Siatagene). Bacterial cultures were grown in LB broth in the presence of ampicillin 
at 100 ug/ml, and induced in log phase growth at an OD600 of 0.6- LO with I mM IPTG for 
4 hour al 37°C. The cell pellet was harvested by centrifugation. 

To clone Hu-Asp2 sequences behind the T7 tag and caspase leader (SEQ ID Nos. 23 
and 24), the construct created above containing the T7-Hu-Asp2 sequence (SEQ ID Nos. 21 
and 22) was opened at the BamH 1 site, and then the phosphoiylatcd caspase 8 leader 
oligonucleotides #559=GATCGATGACTATCrCTGACTCTCCGCGTGAACAGGACG 
(SEQ ID No. 37). #560=OATCCGTCCTGTTCACGCGGAGAGTCAGAGAT AGTCATC 
(SEQ ID No. 38) were annealed and ligaied to the vector DNA. The 5' overhang for each set 
of oligonucleotides was designed such that it allowed ligation into the BamHI site but not 
subsequent digestion with BamHI. The ligation reaction was transformed into JM 109 as 
above for analysis of protein expression after transfer to E. coli strain BL21. 
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Id order to reduce the GC content of the 5' terminus of asp2, a pair of antiporaliel oligos 
were designed to change degenerate codon bases in 15 amino acid positions from G/C to 
A/T (SEQ ID Nos. 25 and 26). The new nucleotide sequence at the 5' end of asp2 did not 
change the encoded amino acid and was chosen to optimize E. Coli expression. The 
sequence of the sense linker is 5' 

CGGCATCCGGCTGCCCCTGCGTAGCGGTCrGGGTGGTGCrCCACTGGGTCTGCG 
TCTGCCCCGGGAGACCGACGAA G 3' (SEQ ID No. 39). The sequence of the antisense 
linker is : 5* 

CTTCGTCGGTCTCCCGGGGCAGACGCAGACCCAGTGGAGCACCACCCAGACCG 
CTACGCAGGGGCAGCCGG ATGCCG 3 ' (SEQ ID No. 40). After annealing the 
phosphoryiatcd linkers together in 0.1 M NaCl-10 mM Tris, pH 7.4 they were ligated into 
unique Oa I and Sma I sites in Hu-Asp2 in the vector pTAC. For inducible expression 
using induction with isopropyl b-D-thiogalactopyranoside (IPTG), bacterial cultures were 
grown in LB broth in the presence of ampiciltin at 100 ug/ml. and induced in log phase 
growth at an OD600 of 0.6-1 .0 with 1 mM BPTG for 4 hour at 3TC. The cell pellet was 
harvested by centriftigation. 

To create a vector in which the leader sequences can be removed by limited 
proteolysis with caspase 8 such that this liberates a Hu-Asp2 polypeptide beginning with 
the N-ierminal sequence GSFV (SEQ ID Nos. 27 and 28), the following procedure was 
followed. Two phosphorylated oligonucleotides containing the caspase 8 cleavage site 
ETD. #571=5- 

GATCGATGACTATCTCTGACTCTCCGCTGGACTCTGGTATCGAAACCGACG 
(SEQ ID No. 4 1 ) and #572= 

GATCCGTCGGTTTCGATACCAGAGTCCAGCGGAGAGTCAGAGATAGTCATC 
(SEQ ID No. 42) were annealed and ligated into pET23a+ tliat had been opened with 
BamHL After transformation into JM109, the purified vector DNA was recovered and 
orientation of the insert was confirmed by DNA sequence analysis. +. the following 
oligonucleotides were used for amplification of the selected Hu-Asp2 sequence: 
#573=5'AAGGATCCnTGTGGAGATGGTGGACAACCTG. (SEQ ID No. 43) 
#554=GAAAGCnTCATGACrCATCTGTCTGTGGAATGTTG (SEQ ID No. 44) which 
placed BamHI and HindQI sites flanking the 5' and 3' ends of the insert, respectively. The 
Asp2 sequence was amplified from the full length Asp2 cDNA cloned into pcDNA3. 1 
using the Advantage-GC cDNA PGR [Qontech] following the manufacturer's supplied 
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protocol using annealing & extension at 68 °C in a two-step PCR cycle for 25 cycles. The 
insert and wctor were cut with BamKI and Hindin, purified by electrophoresis through an 
agarose gel. then ligated using the Rapid DNA Ligation kit [Bocrhinger Mannheim]. The 
ligation reaction was used to transform the E. coli strain JMI09 [Promega] and colonies 
5 were picked for the purification of plasmid (Qiagen.Qiaprcp minispin) and DNA sequence 
analysis . For inducible expression using induction with isopropyl b-D- 
thiogalactopyranoside (IPTG), the expression vector was transferred into E. coli strain 
BL21 (Statagenc). Bacterial cultures were grown in LB broth in the presence of ampiciUin 
at 100 ug/ml, and induced in log phase growth at an OD600 of 0.6-1.0 with 1 mM IPTG for 

10 4 hour at 37°C. The cell pellet was harvested by ccntrifugation. 

To assist purification, a 6- His tag can be introduced into any of the above constructs 
following the T7 leader by opening the construct at the BamHI site and then ligating in the 
annealed, phosphorylated oligonucleotides containing the six histidine sequence 
#565=GATCGCATCATCACCATCACCATG (SEQ ID No. 45), 

15 #566=GATCCATGGTGATGGTGATGATGC (SEQ ID No. 46). The 5' overhang for each 
set of oligonucleotides was designed such that it allowed ligation into the BaniHI site but 
not subsequent digestion with BamHL 
Preparation of Bacterial Pellet: 

36.34g of bacterial pellet representing 1D.8L of growth was dispersed into a total 
20 volume of 200ml using a 20mm tissue homogenizer probe at 3000 to 5000 rpm in 2M KCl, 
O.IM Tris, 0.05M EDTA, ImM DTT. The conductivity adjusted to about 193mMhos with 
water. 

After the pellet was dispened, an additional amount of the KCl solution was added, 

bringing the total volume to 500 ml. This suspension was homogenized fiirther for about 3 

25 minutes at 5000 rpm using the same probe. The mixture was then passed through a Rannie 

high-pressure homogenizer at 10,000psi. 

In all cases, the pellet material was carried forward, while the soluble fraction was 
discarded.Tbe resultant solution was centrifuged in a GS A rotor for Ihr. at 12,500 rpm. The 
peliel was resuspended in the same solution (without the DTT) using the same tissue 
30 homogenizer probe at 2,000 rpm. After homogenizing for 5 minutes at 3000 rpm, the 
volume was adjusted to 500ml with the same solution, and spun for Ihr. at 12,500 rpm. 
The pellet was then resuspended as before, but this time the final volume was adjusted to 
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1-5L with the same solution prior to homogenizing for 5 minutes. After cenirifuging at the 
same speed for 30 minutes, this procedure was repeated. The pellet was then rcsuspcnded 
into about ISOml of cold water, pooling the pellets from the six centrifuge tubes used in the 
GSA rotor- The pellet has homogenized for 5 minutes at 3,000 rpm, volume adjusted to 
2S0ml with cold water, then spun for 30 minutes. Weight of the resultant pellet was 
17.75g. 

Summary: Lysis of bacterial pellet in KCi solution, followed by ccnirifugation in a 
GSA rotor was used to initially prepare the pellet. The same solution was then used an 
additional three times for resuspension/homogenization. A final water 
wash/homogenization was then performed to remove excess KCI and EDTA. 
Solublization of rHuAsp2L: 

A ratio of 9-lOml/gram of pellet was utilized for solubilizing the rHuAsp2L from the pellet 
previously described. 17.75g of pellet was thawed, and ISOml of 8M guanidine HCl, 5mM 
PME. 0.1% DEA, was added. 3M Tris was used to titrate the pH to 8.6. Tlie pellet was 
initially restispended into the guanidine solution using a 20nim tissue homogenizer probe at 
1000 rpm. The mixture was then stirred at 4*'C for 1 hour prior to ccntriftig^ion ai 
12,500ipm for 1 hour in GSA rotor. The resultant supernatant was then centrifuged for 
30min at 40,000 x g in an SS-34 rotor. The final supernatant was then stored at -20°C, 
except for 50ml. 

Immobilized Nickel Affinity Chromatography of Solubilized rHuAsp2L: 
The following solutions were utilized: 

A) 6M Guanidine HCl, 0. IM NaP, pH 8.0, 0.OIM Tris, 5mM pME, 0,5mM Imidazole 

A') 6M Urea, 20inM Nap, pH 6.80, SOmMNaa 

B' ) 6M Urea, 20mM NaP, pH 6.20, 50mM NaCl, 1 2mM Imidazole 

C ) 6M Urea, 20mM NaP, pH 6.80, 50mM NaCl. 300mM Imidazole 

Note: Buffers A' and C were mixed at the ^propriate ratios to give intermediate 

concentrations of Imidazole. 

The 50ml of solubilized material was combined with 50ml of buffer A prior to adding to 
100- 1 25ml Qiagen Ni-NTA SupcrFlow (pre-equilibrated with buffer A) in a 5 x 10cm Bio- 
Rad econo column. This was shaken gently overnight at 4^C in the cold room. 
Chromatography Steps: 

1) Drained the resultant flow through. 

2) Washed with 50ml buffer A (collecting into flow through fraction) 

3) Washed with 250ml buffer A (wash I ) 

4) Washed with 250ml buffer A (wash 2) 

5) Washed with 250ml buffer A* 



53 



wo 00n7369 



PCT/US99/20881 



6) Washed with 250ml buffer B ' 

5 7) Washed with 250ml buffer A* 

8) Huted with 250ml 75mM Imidazole 

9) Huted with 250ml 1 50mM Imidazole ( 1 50- 1 ) 
5 10) Outed with 250ml 150mM Imidazole (150-2) 

1 1 ) Buted with 250ml 300mM Imidazole (300- 1 ) 

12) Elated with 250ml 300mM Imidazole (300-2) 

13) Bated with 250ml 300raM Imidazole (300-3) 

10 Chromatography Results: 

The rHuAsp eluted at 75mM Imidazole through 300mM Imidazole. The 75mM fraction, as 
well as the first 150mM Imidazole (150-1) fraction contained contaminating proteins as 
visualized on Coomassic Blue stained gels. Therefore, fractions 150-2 and 300-1 will be 
utilized for refolding experiments since they contained the greatest amount of protein (see 
20 15 Coomassic Blue stained gel). 

Refolding Experiments of rHuAsp2L: 
Experiment I: 

Forty ml of 150-2 was spiked with IM DTT, 3M Tris. pH 7.4 and DEA to a final 
concentration of 6niM, 50mM, and 0. 1 % respectively. This was diluted suddenly (while 

20 surring) with 200ml of (4°Q cold 20mM NaP, pH 6.8, 150mM NaCl. This dilution gave a 
final Urea concentration of IM. This solution remained clear, even if allowed to set open to 
30 the air at RT or at 4**C . 

After setting open to the air for 4-5 hours at 4*0, this solution was then dialyzed overnight 
against 20mM NaP, pH 7.4, 150mM NaCl, 20% glycerol. This method effectively removes 

25 the urea in the solution without precipitation of the protein. 

35 

Experiment 2: 

Some of the 150-2 cluate was concentrated 2x on an Amicon Centriprep, 10,000 NfWCO, 
then treated as in Experiment 1. This material also stayed in solution, with no visible 
40 precipitation. 
30 
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Experiment 3: 

89inl of the 150-2 eluate was spiked with IM DTT, 3M Tris, pH 7.4 and DEA to a final 
concentration of 6niM. 50inM, and 0.1% respectively. This was diluted suddenly (while 
stiiring) with 445ml of (4**C) cold 20mM NaP. pH 6.8, 150inM NaQ. This solution 
appeared clear, with do apparent precipitation. The solution was removed to RT and stirred 
for 10 minutes prior to adding MEA to a final concentration of 0. ImM. This was stirred 
slowly at RT for Ihr. Cystamine and CuSO* were then added to final concentrations of 
ImM and lO^lM respectively. The solution was stirred slowly at RT for 10 minutes prior to 
being moved to the A^C cold room and shaken slowly overnight, open lo the air. 

The following day, the solution (still clear, with no apparent precipitation) was 
ccntrifuged at 100,000 x g for 1 hour. Supematants from multiple riins were pooled, and 
the bulk of the stabilized protein was dialyzed against 20mM NaP, pH 7.4, 150mM NaCl, 
20% glycerol. After dialysis, the material was stored at -20°C. 

Some (about 10ml) of the protein solution (still in IM Urea) was saved back for 
biochemical analyses, and frozen at -20''C for storage. 

Example 10. Expression of Hu^AspZ and Derivatives in Insect Cells 
Expression by haculovirus infection — The coding sequence of Hu- Asp2 and several 
derivatives were engineered for expression in insect cells using the PCR. For the full- 
length sequence, a 5**sense oligonucleotide primer that modified the translation initiation 
site to fit the Kozak consensus sequence was paired with a 3'-antisense primer that contains 
the natural translation termination codon in the Hu-Asp2 sequence. PCR amplification of 
the pcDNA3.1(hygrD)/Hu-Asp2 template (see Example 12). Two derivatives of Hu-Asp2 
that delete the C-lcrminal transmembrane domain (SEQ ID No. 29 and No, 30) or delete the 
transmembrane domain and introduce a hexa-histidine tag at the C-terminus (SEQ ID No. 
31 and No. 32) were also engineered using the PCR. The same 5'-scnse oligonucleotide 
primer described above was paired with either a 3'-antisense primer that (1) introduced a 
translation termination codon after codon 453 (SEQ ID No. 3) or (2) incorporated a hexa- 
histidine tag followed by a translation termination codon in the PCR using 
pcDNA3.1(hygro)/Hu_Asp-2L as the template. In all cases, the PCR reactions were 
performed amplified for 15 cycles using Pwol DNA polymerase (Bochringcr-Mannhcim) as 
outlined by the supplier. The reaction products were digested to completion with BamHl 
and Notl and Ugated to BamHl and Notl digested baculovirus transfer vector pVL1393 
(Invitrogen). A portion of the ligations was used to transform competent £. eoli DH5a cells 
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followed by antibiotic selection on LB- Amp. Plasmid DNA was prepared by standard 
alkaline lysis and banding in CsCl to yield the baculovinis transfer vectors pVLI393/Asp2, 
pVLl 393/Asp2ATM and pVL1393/Asp2ATM{His)6. Creation of recombinant 
baculovinises and infection of sf9 insect cells was performed using standard methods. 
5 Expression by /riinjfccljon— Transient and stable expression of Hu-Asp2ATM and 

Hu-Asp2ATM(His)6 in High 5 insect cells was perfonncd using the insect expression vector 
pE/VS-His. The DNA inserts from the expression plasmids vectors pVL1393/Asp2, 
pVLl393/Asp2ATM and pVL1393/Asp2ATM(His)6 were excised by double digestion with 
BamHi and Notl and subcloned into BamHl and Notl digested pIZ/V5-His using standard 

10 methods. The resulting expression plasmids. referred to as pIZ/Hu-Asp2ATM and pIZ/Hu- 
Asp2ATM(His)6, were prepared as described above. 

For transfection. High 5 insect cells were cultured in High Five serum free medium 
supplemented with 10 jig/ml gentamycin at 27 "C in scaled flasks. Transfcctions were 
performed using High five cells. High five serum fi«e media supplemented with 10 jig/ml 

15 gentamycin, and InsectinPlus liposomes (Invitrogcn, Carlsbad, CA) using standard 
methods. 

For large scale transient transfcctions 1.2 x lO' high five cells were plated in a 150 
mm tissue culture dish and allowed to attach at room temperamre for 15-30 minutes. 
During the attachment time the DNA/ liposome mixture was prepared by mixing 6 ml of 

20 serum free media, 60 ^g Asp2ATM/pIZ (+/- His) DNA and 1 20 jil of Insectin Plus and 
incubating at room temperature for 15 minutes. The plating media was removed from the 
dish of ceils and replaced with the DNA/liposomc mixture for 4 hours at room temperature 
with constant rocking at 2 rpm. An additional 6 ml of media was added to the dish prior to 
incubation for 4 days at 27 "C in a humid incubator. Four days post transfection the media 

25 was harvested, clarified by ccntrifugation at 500 x g, assayed for Asp2 expression by 

Westem blotting. For stable expression, the cells were treated with 50 fig/ml Zeocin and 
the surviving pool used to prepared clonal cells by limiting dilution followed by analysis of 
the expression level as noted above. 

Purification ofHu'Asp2ATM and Hu-Asp2ATM(His)e — Removal of the 

30 transmembrane segment finm Hu-Asp2 resulted in the secretion of the polypeptide into the 
culmre medium. Following protein production by either baculovirus infection or 
transfection, the conditioned medium was harvested, clarified by centrifugation. and 
dialyzed against Tris-HCl (pH 8.0). This material was then purified by sucessive 
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chromaiography by anioa exchange (Tris-HCl, pH 8.0) followed by cation exchange 
chromatography (Acetate buffer at pH 4.5) using NaQ gradients. The clution profile was 
monitored by (1) Western blot analysis and (2) by activity assay using the peptide subsiraic 
described in-Examplc 12. For the Hu-Asp2ATM(His)e, the conditioned medium was 
dialyzcd against Tris buffer (pH 8.0) and purified by sequential chromatogr^hy on IMAC 
resin followed by anion exchange chromatography. 

Sequence analysis of the purified Hu- A5p2ATM(His)6 protein revealed that the signal 
peptide bad been cleaved [TQHGIRLPLR]. 

Example 11. Expression of Hu-Asp2 in CHO cells 



Heterologous expression of Hu Jisp-IL in CHO-Kl cells — The entire coding sequence of 
Hu-Asp2 was cloned into the mammalian expression vector pcDNA3.1(+)Hygro 
15 (Inviurogen, Carisbad, CA) which contains the CMV immediate early promotor and bGH 
poiyadenylation signal to drive over expression. The expression piasmid, 
pcDNA3.1(+)Hygro/Hu-Asp2, was prepared by alkaline lysis and banding in CsCl and 
completely sequenced on both strands to verify the integrity of the coding sequence. 

20 Wild-type Chinese hamster ovary cells (CHO-Kl ) were obtained from the ATCC. The 

cells were maintained in monolayer culture.«t in a-MEM containing 10% FCS at yfC in 5% 
CO2. Two 100 mm dishes of CHO-KI cells (60% confluent) were transfected with 
pcDNA3.1{+)/Hygro alone (mock) or pcDNA3.1(+)Hygro/Hu-Asp2 usmg the cationic 
liposome DOTAP as recommended by the supplier. The cells were treated with the piasmid 

25 DNA/Iiposome mixtures for 15 hr and then the medium replaced with growth medium 
containing 500 Units/ml hygromycin B. in the case of pcDNA3. l(+)Hygro/Hu-Asp2 
transfected CHO-Kl cells, individual hygromycin B-resistant cells were cloned by limiting 
dilution. Following clonal expansion of the individual cell lines, expression of Hu-Asp2 
protein was accessed by Western blot analysis using a polyclonal rabbit antiserum raised 
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against recombinant Hu-Asp2 prepared by expression in £. colL Near confluent dishes of 
each cell line were harvested by scraping into PBS and ihe cells recovered by 
centrifugalion. The cell pellets were resuspended in cold lysis buffer (25 mM Tris-HCl 
(S.OyS niM EDTA) coniaining protease inhibitors and the cells lyscd by sonication. The 
5 soluble and membrane fractions were separated by ccntrifugation ( 1 05,000 x g, 60 min) and 
normalized amounts of protein from each fraction were then separated by SDS-PAGE. 
Following electrotransfer of the separated polypeptides to PVDF membranes, Hu_Asp-2L 
protein was detected using rabbit aati-Hu-Asp2 antiserum (l/IOOO dilution) and the 
antibody-antigen complexes were visualized using alkaline phosphatase conjugated goat 

to anti-rabbit antibodies (1/2500). A specific immunoreactive protein with an apparent Mr 
value of 65 kDa was detected in pcDNA3.1(+)Hygro/Hu-Asp2 Iransfected cells and not 
mock-transfected cells. Also, the Hu-A5p2 polypeptide was only detected in the membrane 
fraction, consistent with the presence of a signal peptide and single transmembrane domain 
in the predicted sequence. Based on this analysis, done #5 had the highest expression level 

15 of Hu-Asp2 protein and this production cell lines was scaled up to provide material for 
purification. 

Purification of recombinant Hu _^p-2Lfrom CH0-Kl/Hu-Asp2 clone #3 — In a 
typical purification, clone #5 cell pellets derived from 20 150 mm dishes of confluent cells, 
were used as the starting material. The cell pellets were resuspended in 50 ml cold lysis 
20 buffer as ttescribed above. The cells were lysed by polytron homogenization (2 x 20 sec) 
- and the lysate ccntrifugcd at 338.000 x g for 20 minutes. The membrane pellet was then 
resuspended in 20 ml of cold lysis buffer containing 50 mM p-octylglucoside followed by 
rocking at 4*C for Ihr. The detergent extract was clarified by centrifugalion at 338,000 x g 
for 20 minutes and the supernatant taken for further analysis. 
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The p-octylgiucosidc extract was applied to a Mono Q anion exchange column that 
was previously equilibrated with 25 mM Tris-HCl (pH 8.0)/50 mM p-octylglucoside. 
Following sample application, the column was eluted with a linear gradient of increasing 
NaCl concentration (0-1.0 M over 30 minutes) and individual fractions assayed by Western 
blot analysis and for p-secrctase activity {see below). Fractions containing both Hu_Asp- 
2L immunoreactivity and p-secietase activity were pooled and dialyzed against 25 mM 
NaOAc (pH 4.5)/50 mM ^-octylglucoside. Following dialysis, precipitated material was 
removed by centrifugation and the soluble material chromatographed on a MonoS cation 
exchange column that was previously equilibrated in 25 mM NaOAc (pH 43)/ 50 mM p- 
octylglucoside. The column was eluted using a linear gradient of increasing NaCl 
concentration (0-1.0 M over 30 minutes) and individual fractions assayed by Western blot 
analysis and for p-$ecretasc activity. Fractions containing both Hu-Asp2 immunoreactivity 
and P-secretase activity were combined and determined to be >90% pure by SDS- 
PAGE/Coonussie Blue staining. 

Example 12. Assay of Hu-Asp2 p-secretase activity using peptide substrates 
P-secretase assay — P-secrctase activity was measured by quantifying the hydrolysis of a 
synthetic peptide containing the APP Swedish mutation by RP-HPLC with UV detection. 
Each reaction contained 50 mM Na-MES (pH 5.5), 1% p-octylglucoside, peptide substrate 
(SEVNLDAEFR, 70 pM) and enzyme ( 1-5 protein). Reactions were incubated at 37 ''C 
for various times and the reaction products were resolved by EIP-HPLC using a linear 
gradient from 0-70 B over 30 minutes (A=0.1% TFA in water, 

B=).1%TFA/ 10% water/90% AcCN). The elution proCle was monitored by absorbance at 
214 Dm. In preliminary experiments, the two product peaks which eluted before the intact 
peptide substrate, were confirmed to have the sequence DAEFR and SEVNL using both 
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Edman sequencing and MADU-TOF mass spectromcny. Percent hydrolysis of the peptide 

substrate was calculated by comparing the integrated peak areas for the two product 

peptides and the starting material derived from the absorbance at 214 nm. The specificity 

of the protease cleavage reaction was determined by performing the ft-sccretase assay in the 

5 presence of a cocktail of protease inhibitors (8 pM pepstaiin A, 1 0 jiM leupeptin, 1 0 \iM 

E64.and5mM EDTA). 

An alternative p-secretase assay utilizes internally quenched fluorescent substrates 
to monitor enzyme activity using fluorescence spectroscopy in a single sample or multiwcll 
format. Each reaction contained 50 mM Na-NfES {pH 5.5), peptide substraie MCA- 

10 EVKMDAEFtK-DNPJ (BioSource International) (50 ^M) and purified Hu-Asp-2 enzyme. 
These components were equilibrated to 37 *C for various times and the reaction initiated by 
addition of substrate. Excitation was performed at 330 nm and the reaction kinetics were 
monitored by measuring the fluorescence emission at 390 nm. To detect compounds that 
modulate Hu-Asp-2 activity, the test compounds were added during the preincubation phase 

15 of the reaction and the kinetics of the reaction monitored as described above. Activators are 
scored as compounds that increase the rate of appearance of fluorescence while inhibitors 
decrease the rate of appearance of fluorescence. 

It will be clear that the invention may be practiced otherwise than as particularly 
described in the foregoing description and examples. 

20 Numerous modifications and variations of the present invention are possible in tight of the 
above teachings and, therefore, are within the scope of the invention. 
The entire disclosure of all publications cited herein are hereby incorporated by reference. 
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What is claimed is: 

1 . Any isolated or puriried nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta (p) sccrctase cleavage site of APP that coniairu; two or 
niore sets of special nucleic acids, where the special nucleic acids are separated by 
nucleic acids that code for about 100 to 300 amino acid positions, where the amino 
acids in those positions may be any anuno acids, where the first set of special 
nucleic acids consists of the nucleic acids that code for the peptide DTG, where the 
first nucleic acid of the first special set of nucleic acids is, the first special nucleic 
acid, and where the second set of nucleic acids code for cither the peptide DSG or 
DTG, where the last nucleic acid of the second set of nucleic acids is the last special 
nucleic acid, with the proviso that the nucleic acids disclosed in SEQ ID NO. I and 
SEQ. ID NO. 5 are not included. 

2. The nucleic acid polynucleotide of claim 1 where the two sets of nucleic acids are 
squirated by nucleic acids that code for about 125 to 222 amino acid positions, 
which may be any amino acids. 

3. The nucleic acid polynucleotide of claim 2 diat code for about 150 to 172 amino 
acid positions, which may be any amino acids. 

4. Th& nucleic acid polynucleodde of claim that code for about 172 amino acid 
positions, which may be any amino acids. 

5. The nucleic acid polynucleotide of claim 4 where the nucleotides are described in 
SEQ. ID. NO. 3 

6. The nucleic acid polynucleotide of claim 2 where the two sets of nucleic acids are 
separated by nucleic acids thai code for about 150 to 196 amino acid positions. 

7. The nucleic acid polynucleotide of claim 6 where the two sets of nucleotides are 
separated by nucleic acids that code for about 196 amino acids (positions). 
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8. The nucleic acid polynucleotide of claim 7 where the two sets of nucleic acids are 
separated by the same nucleic acid sequences that separate the same set of special 
nucleic acids in SEQ. ID. NO. 5. 

5 9. The nucleic acid polynucleotide of claim 4 where the two sets of nucleic acids are 
separated by nucleic acids th^t code for about 150 to 190, amino acid (positions). 

10. The nucleic acid polynucleotide of claim 9 where the two sets of nucleotides are 
separated by nucleic acids that code for about 190 amino acids (positions). 

to 

1 1 . The nucleic acid polynucleotide of claim 1 0 where the two sets of nucleotides are 
separated by the same nucleic acid sequences that separate the same set of special 
nucleotides in SEQ. ID. NO. 1. 

15 12. Claims 1 - II where the first nucleic acid of the first special set of amino acids, that 
is. the first special nucleic acid, is operably linked to any codon where the nuclic 
acids of that codon codes for any peptide comprising ^m 1 to lO.CXX) ammo acid 
(positions). 

20 13. The nucleic acid polynucleotide of claims 1-12 where the first special nucleic acid is 
operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: any any reporter proteins or proteins which facilitate 
purification. 

25 14. The nucleic acid polynucleotide of claims I - 1 3 where the first special nucleic acid is 
operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: immunoglobin-heavy chain, maltose binding protein, gtutathion 
S transfecrion, Green Fluorescent protein, and ubiquitin. 

30 15. Claims 1-14 where the last nucleic acid of the second set of special amino acids, that 
is, the last special nucleic acid, is operably linked to nucleic acid polymers that code 
for any peptide comprising any amino acids from 1 to 10,(XX) amino acids. 
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16- Claiins 1-15 where the last special nucleic acid is opcrably linked to any codon 
linked to nucleic acid polymers thai code for any peptide selected from the group 
consisting of: any reporter proteins or proteins which facilitate purification. 

5 17. The nucleic acid polynucleotide of claims 1-16 where the first special nucleic acid is 
operably linked to nucleic acid polymers that code for any peptide selected from the 
group consisting of: immunoglobin-faeavy chain, maltose binding protein, glutathion 
S transfection. Green Fluorescent protein, and ubiquitin. 

10 18. * Any isolated or purified nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta secretase cleavage site of APP that contains two or 
more sets of special nucleic acids, where the special nucleic acids are separated by 
nucleic acids that code for about 100 to 300 amino acid positions, where the amino 
acids in those positions may be any amino acids, where the first set of special 

IS nucleic acids consists of the nucleic acids that code for DTG, where the first nucleic 

acid of the first special set of nucleic acids is, the first special nucleic acid, and 
where the second set of nucleic acids code for either DSG or DTG, where the last 
nucleic acid of the second set of special nucleic acids is the last special nucleic acid, 
where the ftrst special nucleic acid is oper^ly linked to nucleic acids that code for 

20 any number of amino acids from zero to 8 1 amino acids and where each of those 

radons may code for any amino acid. 

19. The nucleic acid polynucleotide of claim 18 , where the first special nucleic acid is 
opcrably linked to nucleic acids that code for any number of from 64 to 77 amino 

25 acids where each codon may code for any amino acid. 

20. The nucleic acid polynucleotide of claim 19 , where the first special nucleic acid is 
opcrably linked to nucleic acids that code for about 71 amino acids peptide. 

30 21. The nucleic acid polynucleotide of claim 20, where the first special nucleic acid is 
operably linked to 71 amino acid peptide and where the first of those 7 1 amino acids 
is the amino acid T. 
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22. The nucleic acid poiynucleotide of claim 21, where the polynucleotide comprises a 
sequence that is at least 95% identical to the same corresponding amino acids in 
SEQ. ID. NO, 3, that is. identical to the sequences in SEQ. ID. NO. 3 including the 
sequences from both the first and or the second special nucleic acids, toward the N- 
5 Terminal, through and including 71 amino acids, see Example 10, beginning from 

the DTG site and including the nucleotides from thai code for 71 amino acids). 



23. The nucleic acid polynucleotide of claim 22. where the complete polynucleotide 

comprises identical to the same corresponding amino acids in SEQ. ID. NO. 3, that 
10 is, identical to the sequences in SEQ. ID. NO. 3 including the sequences from both 

the first and or the second special nucleic acids, toward the N-Tcrminal, through 
and including 71 amino acids, see Example 10, beginning from the DTG site and 
including the nucleotides from that code for 71 amino acids). 



15 24. The nucleic acid polynucleotide of claim 18 , where the first special nucleic acid is 
operably linked to nucleic acids that code for any number of from about 30 to 54 
amino acids where each codon may code for any amino acid. 

25. The nucleic acid polynucleotide of claim 20. where the first special nucleic acid is 
20 operably linked to 47 codons where the first those 35 or 47 amino acids is the 

amino acid £ or G. 



26. The nucleic acid polynucleotide of claim 2 1 , where the polynucleotide comprises a 
sequence that is at least 95% identical to the same corresponding amino acids in 

25 SEQ. ID. NO. 3, that is, identical to that portion of the sequences in SEQ. ID. NO. 

3 including the sequences from both the first and or the second special nucleic 
acids, toward the N-Terminal, through and including 35 or 47 amino acids, sec 
Example 1 1 for the 47 example, beginning from the DTG site and including the 
nucleotides from that code for the previous 35 or 47 amino acids before the DTG 

30 site). 
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27. The nucleic acid polynucleotide of claim 22, where the polynucleotide comprises 
identical to the same corresponding amino acids in SEQ. ID. NO. 3, that is. 
identical to the sequences in SEQ. ID. NO. 3 including the sequences from both the 
fiistand or the second special nucleic acids, toward the N-Terminal, through and 
includiag 35 or 47 amino acids, see Example 1 1 for the 47 example, beginning from 
the DTG site and including the nucleotides from that code for the previous 35 or 47 
amino acids before the DTG site). 

28. * Any isolated or purified nucleic acid polynucleotide that codes for a protease 
capable of cleaving the beta (p) secreiase cleavage site of APP that contains two or 
more sets of special nucleic acids, where the special nucleic acids are separated by 
nucleic acids that code for about 100 to 300 amino acid positions, where the amino 
acids in those positions may be any amino acids, where the first set of special 
nucleic acids consists of the nucleic acids that code for the peptide DTG, where the 
first nucleic acid of the first special set of amino acids is, the first special nucleic 
acid, and where the second set of special nucleic acids code for cither the peptide 
DSG or DTG, where the last nucleic acid of the second set of special nucleic acids, 
the last special nucleic acid, is operably linked to nucleic acids that code for any 
number of codons horn 50 to 170 codons. 

29. The nucleic acid polynucleotide of claim 29 where the last special nucleic acid is 
operably linked to nucleic acids comprising from 100 to 170 codons. 

30. The nucleic acid polynucleotide of claim 30 where the last special nucleic acid is 
operably linked to nucleic acids comprising from 142 to 163 codons. 

3 1 . The nucleic acid polynucleotide of claim 3 1 where the last special nucleic acid is 
operably linked to nucleic acids comprising about 142 codons. 



32. 



The nucleic acid polynucleotide of claim 32 where the polynucleotide comprises a 
sequence that is at least 95% identical to SEQ. ID. # (Example 9 or 10). 
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33 . Tbe nucleic acid polynucleotide of claim 33, where ihe complete polynucleotide 
comprises SEQ. E>. U (Example 9 or tO). 

34. Thc-nucieic acid polynucleotide of claim 3 1 where the last special nucleic acid is 
5 opcrably linked to nucleic acids comprising about 163 codons. 

35. The nucleic acid polynucleotide of claim 35 where the polynucleotide comprises a 
sequence that is at least 95% identical to SEQ. ID. # (Example 9 or 10). 

10 36. The nucleic acid polynucleotide of claim 36, where the complete polynucleotide 
comprises SEQ. ID. # (Example 9 or 10). 

37. The nucleic acid polynucleotide of claim 3 1 where the last special nucleic acid is 
operably linked to nucleic acids comprising about 170 codons. 

15 

38. Claims 1-38 where the second set of special nucleid acids code for the peptide DSG, 
and optionally the first set of nucleic acid polynucleotide is operably linked to a 
peptide purification tag. 

20 39. Claims 1-39 where the nucleic acid polynucleotide is operably linked to a peptide 
purification tag which is six histidine. 

40. Qaims MO where the first set of special nucleic acids are on one polynucleotide 
and die second set of special nucleic acids are on a second polynucleotide, where 

25 both first and second polynucleotides have at lease 50 codons. 

41. Claims 1-40 where the first set of special nucleic acids are on one polynucleotide 
and the second set of special nucleic acids are on a second polynucleotide, where 
both first and second polynucleotides have at lease 50 codons where both said 

30 polynucleotides are in the same solution. 

42. A vector which contains a polyiuicleotide described in claims I -42. 
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43. A cell or cell line which contans a polynucleotide described in claims 1 -42. 



44. Any isolated or purified peptide or protein comprising an amino acid polymer that is 
a protease capable of cleaving the beta (p) secrctasc cleavage site of APP that 
contains two or more sets of special amino acids, where the special amino acids are 
separated by about 100 to 30Q amino acid positions, where each amino acid 
position can be any amino acid, where the first set of special amino acids consists of 
the peptide DTG, where the first amino acid of the first special set of amino acids is, 
the first special amino acid, where the second set of amino acids is selected from the 
peptide comprising cither DSG or DTG, where the last amino acid of the second set 
of special amino acids is the last special amino acid, with the proviso that the 
proteases disclosed in SEQ ID NO. 2 and SEQ. ID NO. 6 are not included. 

45. The amino acid polypeptide of claim 45 where the two sets of amino acids arc 
separated by about 125 to 222 amino acid positions where in each position it may be 
any amino acid. 

46. The amino acid polypeptide of claim 46 where the two sets of amino acids are 
separated by about 150 to 172 amino acids. 

47. The amino acid polypeptide of claim 47 where the two sets of amino acids are 
separated by about 172 amino acids. 

48- The amino acid polypeptide of claim 48 where the protease is described in SEQ. ED. 
NO. 4 

49. The amino acid polypeptide of claim 46 where the two sets of amino acids are 
separated by about 150 to 196 amino acids. 

50. The amino acid polypeptide of claim 50 where the two sets of amino acids arc 
separated by about 196 amino acids. 
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5 1 . The amino acid polypeptide of claim 5 1 where the two sets of amino acids are 
separated by the same amino acid sequences that separate the same set of special 
amino acids in SEQ. ID. NO. 6. 

52. The amino acid polypeptide of claim 46 where the two sets of amino acids arc 
separated by about 150 to 190. amino acids. 

53. The amino acid polypeptide of claim 53 where the two sets of nucleotides are 
separated by about 190 amino acids. 

54. The amino acid polypeptide of claim 54 where the two sets of nucleotides are 
separated by the same amino acid sequences that separate the same set of special 
amino acids in SEQ. ID. NO. 2. 

55. Claims 45-55 where the fust amino acid of the first special set of amino acids, that 
is, the first special amino acid, is operably linked to any peptide comprising from 1 
to 10,000 amino acids. 

56. The amino acid polypeptide of claims 45-56 where the first special amino acid is 
operably linked to any peptide selected from the group consisting of: any any 
reporter proteins or proteins which facilitate purification. 

57. The amino acid polypeptide of claims 45-57 where the first special amino acid is 
operably linked to any peptide selected from die group consisting of: 
immunoglobin-faeavy chain, maltose binding protein, glutathion S transfeciion, 
Green Fluorescent protein, and ubiquitin. 

58. Claims 45-58, where the last amino acid of the second set of special amino acids, 
that is. the last special amino acid, is operably linked to any peptide comprising any 
amino acids ^m 1 to 10.000 amino acids. 
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59. Qaims 45-59 where the last special amino acid is operably hnkcd any peptide 
selected from the group consisting of any rcponer proteins or proteins which 
facilitate purification. 

5 60. The amino acid polypeptide of claims 45-60 where the firsi special amino acid is 
operably linked to any peptide selected from the group consisting of : 
immunoglobin-heavy chain, maltose binding protein, glutathion S transfection. 
Green Fluorescent protein, and ubiquitin. 

10 61. * Any isolated or purified peptide or protein comprising an amino acid 

polypeptide thai codes for a protease capable of cleaving the beta secretasc cleavage 
site of APP that contains two or more sets of special amino acids, where the special 
amino acids are separated by about 100 to 300 amino acid positions, where each 
amino acid in each position can be any amino acid, where the first set of special 

15 amino acids consists of the amino acids DTG, where the first amino acid of the first 

special set of amino acids is, the Bist special amino acid, D, and where the second 
set of amino acids is either DSG or DTG, where the last amino acid of the second 
set of special amino acids is the last special amino acid, G, where the first special 
amino acid is operably linked to amino acids that code for any number of amino 

20 acids from zero to 8 1 amino acid positions where in each position it may be any 

amino acid. 

62. The amino acid polypeptide of claim 62, where the first special amino acid is 
operably linked to a peptide from about 30 to 77 amino acids positions where each 

75 amino acid position may be any amino acid. 

63. The amino acid polypeptide of claim 63, where the first special amino acid is 
operably linked to a peptide of 35, 47. 7 1 , or 77 amino acids. 

30 64. The amino acid polypeptide of claim 63. where the first special amino acid is 

operably linked to the same corresponding peptides from SEQ. ID. NO. 3 that are 
35, 47, 71, or 77 peptides in length, beginning counting with the amino acids on the 
first special sequence, DTG, towards the N-terminal of SEQ. ID. NO. 3. 

69 
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65. The amino acid polypeptide of claim 65, where the polypeptide comprises a 
sequence that is at least 95% identical to the same conrsponding amino acids in 
SEQ. ID. NO. 4, that is, identical to that portion of the sequences in SEQ.E). NO. 4, 

S including all the sequences from both the fust and or the second special nucleic 

acids, toward the N- terminal, through and including 71 . 47, 35 amino acids before 
the first special amino acids. (Examples 10 and 11). 

66. The amino acid polypeptide of claim 65, where the complete polypeptide comprises 
10 the peptide of 7 i amino acids, where the first of the amino acid is T and the second 

isQ. 

67. The amino acid polypeptide of claim 62, where the first special amino acid is 
opcrably linked to any number of from 40 to 54 amino acids (positions) where each 

IS amino acid position may be any amino acid. 

68. The amino acid polypeptide of claim 68, where the lirst special amino acid is 
opcrably linked to amino acids that code for a peptide of 47 amino acids. 

20 69. The amino acid polypeptide of claim 69, where the first special amino acid is 

operably linked to a 47 amino acid peptide where the first those 47 amino acids is 
the amino acid E. 

70. The amino acid polypeptide of claim 70, where the polypeptide comprises a 
25 sequence that is at least 95% identical to SEQ. ID. # (Example 10). 

7 1 . The amino acid polypeptide of claim 7 1 , where the complete polypeptide comprises 
SEQ. ID. # (Example 10). 

30 72. * Any isolated or purified amino acid polypeptide that is a protease capable of 
cleaving the beta (P) secretase cleavage site of APP that contains two or mors sets 
of special amino acids, where the special amino acids are separated by about 100 to 
300 amino acid positions, where each amino acid in each position can be any amino 
70 
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acid, where the first set of special amino acids consists of the amino acids that cods 
for DTG, where the first amino acid of the first special set of amino acids is, the 
first special amino acid, and where the second set of amino acids arc cither DSC 
or DTG, where the last amino acid of the second set of special amino acids is the 
last special amino acid. G, which is operably linked to any number of amino acids 
from 50 to 170 amino acids, which may be any amino acids. 

73. The amino acid polypeptide of claim 73 where the last special amino acid is 
operably linked to a peptide of about 100 to 170 amino acids. 

74. The amino acid polypeptide of claim 74 where the last special amino acid is 
operably linked to to a peptide of about 142 to 163 amino acids. 

75. The amino acid polypeptide of claim 75 where the last special amino acid is 
operably linked to to a peptide of about about 142 amino acids. 

76. The amino acid polypeptide of claim 76 where the polypeptide comprises a 
sequence that is at least 95% identical to SEQ. E>. # (Example 9 or 10). 

77. The amino acid polypeptide of claim 75 where the last special amino acid is 
operably linked to a peptide of about 163 amino acids. 

78. The amino acid polypeptide of claim 79 where the polypeptide comprises a 
sequence that is at least 95% identical to SEQ. ID. # (Example 9 or 10). 

79. The amino acid polypeptide of claim 79, where the complete polypeptide comprises 
SEQ. ID. # (Example 9 or 10). 

80. The amino acid polypeptide of claim 74 where the last special amino acid is 
operably linked to to a peptide of about 170 amino acids. 

8 1 . Claim 46-8 1 where the second set of special amino acids is comprised of the peptide 
with the amino acid sequence DSG. 

71 



wo 00/17369 



PCTAJS99/2088I 



82. Claims 45-82 where the amino acid polypeptide is opcrably linked to a peptide 
purification tag. 

83. Qaims 45-83 where the amino acid polypeptide is opcrably linked to a peptide 
purification tag which is six histidine. 

84. Claims 45-84 where the first set of special amino acids arc on one polypeptide and 
the second set of special amino acids are on a second polypeptide, where both first 
and second polypeptide have at lease 50 amino acids, which may be any amino 
acids. 

85. Claims 45-84 where the first set of special amino acids are on one polypeptide and 
the second set of special amino acids are on a second polypeptide, where both fust 
and second polypeptides have at lease 50 amino acids where both said polypeptides 
are in the same vessel. 

86. A vector which contains a polypeptide described in claims 45-86. 

87. A cell or cell line which contans a polynucleotide described in claims 45-87. 

88. The process of making any of the polynucleotides, vectors, or cells of claims 1-44 

89. The process of making any of the polypeptides, vectors: or cells of claims 45-88 

90. Any of the polynucleotides, polypeptides, vectors, cells or cell lines described in 
claims 1-88 made from the processes described in claims 89 and 90. 



91. ♦ An isolated nucleic acid molecule comprising a polynucleotide, said 
polynucleotide encoding a Hu-Asp polypeptide and having a nucleotide sequence at least 
95% identical to a sequence selected from the group consisting of: 
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(a) a nucleotide sequence encoding a Hu-Asp polypeptide selected from 
the group consisting of Hu-Asp 1. Hu-Asp2(a), and Hu-Asp2(b), wherein said Hu-Asp 1. 
Hu-Asp2(a) and Hu-Asp2(b) polypeptides have the complete amino acid sequence of SEQ 
ID N0:2. SEQ ID NO;4, and SEQ ID No:6, rcspeciively; and 

(b) a nucleotide sequence complemcntaiy to the nucleotide sequence 

of(a). 

92. Tte nucleic acid molecule of claim 92, wherein said Hu-Asp polyi)cptide is Hu- 
Aspl. and said polynucieoiide molecule of 1(a) comprises the nucleotide sequence of SEQ 
IDN0:1. 

93. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 
Asp2(a). and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of 
SEQIDN0:4. 

94. The nucleic acid molecule of claim 92, wherein said Hu-Asp polypeptide is Hu- 
Asp2(b), and said polynucleotide molecule of 1(a) comprises the nucleotide sequence of 
SEQIDNO:5. 

95. An isolated nucleic acid molecule comprising polynucleotide which hybridizes 
under stringent conditions to a polynucleotide having the nucleotide sequence in (a) or (b) 
of claim 92. 

96. A veaor comprising the nucleic acid molecule of claim 96. 

97. The vector of claim 97, wherein said nucleic acid molecule is opcrably linked to a 
promoter for the expression of a Hu-Asp polypeptide. 

98. The vector of claim 97, wherein said Hu-Asp polypeptide is Hu-Asp 1 . 

99. The vector of claim 97, wherein said Hu-Asp polypeptide is Hu-Asp2(a). 

too. The vector of claim 97, wherein said Hu'Asp polypeptide is Hu-Asp2(b). 
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101 . A host cell comprising the vector of claim 98. 

102. A method of obtaining a Hu-Asp polypeptide comprising culturing the host cell of 
claim 102 and isolating said Hu-Asp polypeptide. 

103. An isolated Hu-Asp I polypeptide comprising an amino acid sequence at least 95% 
identical to a sequence comprising the amino acid sequence of SEQ ID NO:Z 

104. An isolated Hu-Asp2(a) polypeptide comprising an amino acid sequence at least 
95% identical to a sequence comprising the amino acid sequence of SEQ ID NO:4. 

105. An isolated Hu-A5p2(a) polypeptide comprising an amino acid sequence al least 
95% identical to a sequence comprising the amino acid sequence of SEQ ID NO:8. 

106 An isolated antibody that binds specifically to the Hu-Asp polypeptide of any of 
claims 104-107. 

sequence comprising the amino acid sequence of SEQ ID NO:8. 

107 An isolated antibody that binds specifically to the Hu-Asp polypeptide of any of 
claims 104-107. 

108 . * A method to identify a cell that can be used to screen for inhibitors of P 
sccretase activity comprising: 

a) identifying a cell that expresses a protease ci^)able of cleaving APP at the p 

sccretase site. 

comprising: 

i) collect the cells or the supemantent from the cells to be identified 

ii) measure the production of a critical peptide, where the critical 
peptide is selected from the group consisting of either the APP C- 
teiminal peptide or soluble APP, 

iii) select the cells which produce the critical peptide. 
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109. The method of ciaitn 1 08 where the cells are collected and the critical peptide is the 
APP C-tenninal peptide created as a result of the p secrclase cleavage. 



1 10. The method of claim 108 where the supcraantcnt is collected and the critical peptide 
5 is soluble APP where the soluble APP has a C-tcnninal created by P sccretasc cleavage. 

111. The method of claim 108 where the cells contain any of the nucleic acids or 
polypeptides of claims 1-86 and where the cells are shown to cleave the p secrctase site of 
any peptide having the following peptide structure. P2. Pi, PI'. P2*, where P2 is K or N, 

10 where PI is M or L, where PI* is D, where P2' is A. 

112. The method of claim ill where P2 is K and PI is M. 

113 The method of claim 112 where P2 is N and PI is L. 

15 

114 * Any bacterial cell comprising any nucleic aciis or peptides in claims 1-86 
and 92-107. 

115 A bacterial cell of claim 1 14 where the bacteria is E coli. 

20 

1 16 Any eukaryotic cell comprising any nucleic acids or polypeptides in claims 1 -86 and 
92-107. 

117 * Any insect cell comprising any of the nucleic acids or polypeptides in claims 
25 1-86 and 92-107. 

1 18 A insect cell of claim 1 17 where the insect is sf9, or High 5. 

119 A insect cell of claim 100 where the insect cell is High 5. 

30 

120 A mammalian cell comprising any of the nucleic acids or polypeptides in claims l- 
86 and 92-107. 
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121 A mammalian cell of claim 120 wheic tfae mammalian cell is selected from the 
group consisting of, human, rodent, lagomoipb, and primate. 

122 A mammalian cell of claim 121 where the mammalian celt is selected from the 
5 group consisting of human cell. 

123 A mammalian cell of claim 122 where the human ceil is selected fmm the group 
comprising HEK293. and IMR-32. 

10 124 A mammalian cell of claim 12 1 where the cell is a primate cell. 

125 A primate cell of claim 124 where the primate cell is a COS-7 cell. 

126 A mammalian cell of claim 12 1 where cell is selected from a rodent cells. 

15 

127 A rodent cell of claim 126 selected from, CHO-Kl . Neuro-2A. 3T3 cells. 

128 A yeast cell of claim 115. 

20 129 An avian cell of claim 115. 

1 30. ♦ Any isoform of APP where the last two carboxy terminus amino acids of th^ 
isoform are both lysine residues. 

25 131 The isoform of APP from claim 1 30 comprising the isoform known as APP695 
modified so that its last two having two lysine residues as its last two carboxy tenninus 
amino acids. 

132 The isofonn of claim 131 comprising SEQ.ID. 16. 

30 

133 The isoform variant of claim 130icomprising SEQ. ID. NO. 18, and 20. 
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1 34 Any cukaiyotic cell line, comprising nucleic acids or polypeptides of claim 130- 
133. 

135 Any cell line of claim 134 that is a mammaliam cell line (HEK293, Neuro2a, arc 
prcfeired plus any others.) 

136 A method for identifying inhibitors of an enzyme thai cleaves the beta secretase 
cleavabe site of APP comprising: 

a) culturing cells in a culture medium under conditions in which the enzyme 
causes processing of APP and release of amyloid beta-peptide 
into the medium and causes the accumulation of CTF99 fragments of APP in cell 
lysates. 

b) exposing the cultured cells to a test compound; and spcci fically 
determining whether the test compound inhibits the fiinction of the enzyme by 
measuring the amount of amyloid beta-peptide released into the 

medium and or the amount of CTF99 fragments of APP in cell lysates; 

c) identifying test compounds diminishing the amount of soluble amyloid beta 
peptide present in the culture medium and diminution of CTF99 fragments of APP in cell 
lysates as Asp2 inhibitors. 

137 The method of claim 136 wherein the culmred cells are a human, rodent or insect 
cell line. 

1 38 The method of claim 1 37 wherein the human or rodent cell line exhibits P secretase 
activity in which processing of APP occurs with release of amyloid beta-peptide into the 
culture medium and accimiulation of CTF99 in cell lysates. 

139. A method as in claim 1 38 wherein the human or rodent cell line treated with the 
antisense oligomers directed against the enzyme that exhibits P secretase activity, reduces 
release of soluble amyloid beta-peptide into the culture medium and accumulation of 
CTF99 in cell lysates. 
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140. A method for the idcatification of an agent that decreases the activity of a Hu-Asp 
polypeptide selected from the group consisting of Hu-Asp 1, Hu-Asp2(a). and Hu-Asp2(b), the 
method comprising 

(a) - • dctcnnining the activity of said Hu-Asp polypeptide in the presence of a test 
agent and in the absence of a test agent; and 

(b) comparing the activity of said Hu-Asp polypeptide determined in the 
presence of said test agent to the activity of said Hu-Asp polypeptide 
detennined in the absence of said test agent; 

whereby a lower level of activity in the presence of said test agent than in the absence of said 
test agCTt indicates that said test agent has decreased the activity of said Hu-Asp polypepdde.. 



141 . The nucleic acids, peptides, proteius. vectors, cells and cell lines, and assays described 
herein. 
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ncURE 1 (1) 

ATaSGCGCACTCGCCCGGCSCGCTGCTCCTGCCTCTQCTGGCCCAGTCKSCrc^ 
KGALARALLLPLLACWLLRA 

CCCCGGAGCTGGCCCCCCCCCCCTTCACGCTGCCC2TCCGGGTGGCCGCGGCCACGAAC 
APSLAPAPFTL.PLRVAAATN 

CGCGTACrrTGCGCCCACCCCGGGACCCGGGACCCCTGCCGAGCCCCACCCCGACGGCTTC 
RVVAPTPGPGTPAERHADGL 

GCGCTCGCCCTGGAGCCTGCCCTGGCGTCCCCCGCGGGCGCCGCCAACTTCTTGGCCATG 
ALALEPALASPAGAANFLAM 

GTAGACAACCTGCAGGGGGACTCTGGCCGCGGCTACTACCTGGAGATGCTGATCGGGACC 
VDNL QGDSGRGYVLEMLIGT 

CCCCCGCAGAAGCTACAGATTCTCGTTGACACTGGAAGCAGTAACTTTGCCGTGGCAG^ 
PPQKLQILVDTGSSNFAVAG 

ACCCCGCACTCCTACATAGACACGTACTTTGACACAGAGAGGTCTAGCACATACCGCTCC 
TPHSYIDTYFDTERSSTYRS 

AAGGGCTTTQACGTCACAGTCAAGTACACACAAGGAAGC?GGACGGGCTTCG^ 
KGFDVTVKYTQGSWTGFVGE 

GACCTCGTCACCATCCCCAAAGGCTrcAATACTTCTTTTCTTGTCAACATT^ 
DLVTI PKGFNTSFLVNIATI 

TTTGAATCAGAGAATTTCTTTTTGCCTGGGATrAAATGGAATCGAAT^^ 
FESENFFLPGIKWNGILGLA 

TRTGCCACACTTGCCAAGCCATCAAGTTCTCTGGAGACCTTCTTCGACTCCCTGG^ 
YATLAKPSSSLETFFDSLVT 

CAAGCAAACATCCCCAACGTTTTCTCCATGCAGATGTGTGGAGCCGGCTTGCCCGITGCT 
QANIPKVFSMQMCGAGLPVA 

GGATCTGGGACCAACGGAGGTAGTCTTGTCTITOGTGGAATTGAACCAAGTT^ 
GSGTNGGSLVLGGIEPSLYK 

GGAGACATCTCGTATACCCCTATTAAGGAAGAGTGGTACTACCAGATAGAAATTCTGAAA 
GDIWYTPIKEEWYYQIEILK 

TTGGAAATTQaAGGCCAAAGCCTTAATCTCGACTGCAGAGAGTATAACCCAGACAAGGCC 
LEIGGOSLNLDCREYNADKA 

ATCGTOJACAGTGGCACCACGCTGCTGCGCCTCCCCCAGAAGGTGTTTGATGCGGTGGTG 
IVDSGTTLLRLPQKVFDAVV 

GAAGCTGTGGCCCGCGCATCTCTXSATTCCAGAATTCTCTGATGGITTC^^ 
EAVAHASLIPEFSDGFWTGS 



QLACWTNSETPWSYFPKI SI 

TACCTGAGAGATGAGAACTCCAGCAGGTCArPCCGTATCACAATCCTGCCTCAGCTTTAC 
YLRDENSSRSFRITILPQLY 

ATTCAGCCCATGATGGGGGCCGGCCTGAATTATGAATGlTACCGATTCGGCTiTTTCCCCA 
laPMMGAGLNYECYRFGISP 

TCCACAAATGCGCTGGTGATCGGTGCCACGGTGATGGAGGGCTIOTACGrcATCTTCGAC 
STNALVIGATVMEGFYVIFD 

AGAGCCCAGAAGAGGGTtKKCTTCGCAGCGAGCCCCTGTGCAGAAATTGa^GGTGCTGCA 
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notms 1 (2) 



RAQKRVGFAASPCAEIAGAA 

GTGTCTGAAATTrcCGGGCCTTTCTCAACAGACGATGTAGCCAGCAACTCrTC 
VSEISGPFSTEDVASNCVPA 

CAGTCTTnSAGCGAGCCCATTTTCTRKATTGTCTCCTATSCGCTCATC 
QSLSEPl LVJ.IVSYALKSVCG 

GCCATCCTCCTTGTCTTAATCGTCCTGCTGCTGCrrGCCGTTCCGGTGTC^ 
AILLVLIVLLLLPFRCQRRP 

CGTGACCCTGAGGTCGTCAATGATGACTCCTCTCItKTCAGACATCGCTGGAAA 
RDPEVVNDESSLVRHRWK 

GCCAGGCCTGACCTCAAGCAACCATGAACTCAGCTATTAAGAAAATCACATITCCAGGGC 

AGCAGCCGGGATCGATGGT3GCGC?TTCTCCTG?GCCCACCC3TCrrCAA TCTCT GTTCT 

GCTCCCAGATCCCTTCTAGATTCACTGTCrrTTI^TTCTTCATT^^ 

CTCCCTACTTCCAAGAAAAATAATTAAAAAAAAAACTTCATTCTAAACCAAAAAAAAAAA 

AAAA 
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riOURZ 3 (l) 



ATGGCCCAAGCCCTGCCCTGGCT^CTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 
MAQALPWLLLWMGAGVLPAH 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 
GTQHGIRLPLRSGLGGAPLG 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VEMVDNLRGKSGQGYYVEMT 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 
VGSPPQTLNILVDTGSSNPA 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 
VGAAPHPFLHRYYQRQLSST 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
LGTDLVSIPHGPNVTVRANI 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTG 
AAITESDKFFIKGSNWEGIL 

GGGOTGGCCTATGCTGAGATTGCCAGGCTTTGTGGTCCTGGCTTCCCCCTCAACCA^ 
GLAYAEIARLCGAGFPLNQS 

GAAGTGCIGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATCGACCACrCGCTGTAC 
EVLASVGGSMIIGGIDHSLY 

ACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTGATCATTGTG 
TGSLWYTPIRREWYYEVI IV 

CGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAGTACAACTATGACAAG 
RVEINGQDLKMDCKEYNYDK 

AGCATTGTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCTG 
S IVDSGTTNLRLPKKVFEAA 

GTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGATGGTTTCTGGCTAGGA 
VKSIKAASSTEKFPDGFWLG 

GAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATTTTCCCAGTCATCTCA 
EQLVCWQAGTTPWNIFPVIS 

CTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACCATCCTTCCGCAGCAA 
LYLMGEVTNQSFRITILPQQ 

TACCTGCGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTACAAGTTTGCCATC 
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rioDRs a (2) 



YLRPVEDVATSQDDCYKFAI 

TCACAdltATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTC 
SQSSTGTVMGAVIMEGFYVV 

TTTGATCGGGCCCGAAAACGAATJ<3GCTTTGCTGTCAGCGCTTGCCATGTGCACGATGAG 
FDRARKRIGFAVSACHVHDE 

TTCAGGACGGCAGCGGTGGAAGGCCCTTTTGTCACCTTGGACATGGAAGACTGTGGCTAC 
FRTAAVEGPFVTLDMEDCGY 

AACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCCTATGTCATGGCTGCCATC 
NIPQTDESTLMTIAYVMAAI 

TGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGTGGCGCTGCCTCCGCTGC 
CALFMLPLCLMVCQWRCLRC 

CTGCGCCAGCAGCATGATGACTTTGCTGATGACATCTCCCTGCTGAAGTGAGGAGGCCCA 
LRQQHDDFADDISLLK 

TGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCACTTTGGTCACA^ 

GGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCCCCACCCACCAAATGC 

CTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAGGGACTGTACCTGTAG 

GAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACTCTTGGTCACCTCAAA 

TTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCCTGAACCTTTGTCCACCAT^ 

TTAAATTCTCCAACCCAAAGTATTCTTCTTTTCITAGTTTCAGAAGTAC^ 

CCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATTrGCTTTAGAGACAGG 
GACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCrCTTGAAAAAAAAAAAAA 
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FIGURE 3 (1) 



ATGGCCCAAGCCCTGCCCTGGCTCCTGCTGTGGATGGGCGCGGGAGTGCTGCCTGCCCAC 
MAQALPWLLLWMGAGVLPAH 

GGCACCCAGCACGGCATCCGGCTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGG 
GTQHGIRLPLRSGLGGAPLG 

CTGCGGCTGCCCCGGGAGACCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTT 
LRLPRETDEEPEEPGRRGSF 

GTGGAGATGGTGGACAACCTGAGGGGCAAGTCGGGGCAGGGCTACTACGTGGAGATGACC 
VEMVDNLRGKSGQGYYVEMT 

GTGGGCAGCCCCCCGCAGACGCTCAACATCCTGGTGGATACAGGCAGCAGTAACTTTGCA 
VGSPPQTLNILVDTGSSNFA 

GTGGGTGCTGCCCCCCACCCCTTCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 
VGAAPHPFLHRYYQRQLSST 

TACCGGGACCTCCGGAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAG 
YRDLRKGVYVPYTQGKWEGE 

CTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACATT 
LGTDLVSIPHGPNVTVRANI 

GCTGCCATCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGGGAAGGCA^ 
AAITESDKFFINGSNWEGIL 

GGGCTGGCCTATGCTGAGATTGCCAGGCOTGACGACTCCCTGGAGCCrriC'rriGACTCT 
GLAYAEIARPDDSLEPFFDS 

CTGGTAAAGCAGACCCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 
LVKQTHVPNLFSLQLCGAGF 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 
PLNQSEVLASVGGSMIIGGI 

GACCACTCGCTGTACACAGGCAGTCTCTGGTATACACCCATCCGGCGGGAGTGGTATTAT 
DHSLYTGSLWYTPIRREWYY 

GAGGTCATCATTGTGCGGGTGGAGATCAATGGACAGGATCTGAAAATGGACTGCAAGGAG 
EVIIVRVEINGQDLKMDCKE 

TACAACTATGACAAGAGCATTCTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAA 
YNYDKSIVDSGTTNLRLPKK 

GTGTTTGAAGCTGCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGAT 
VFEAAVKSIKAASSTEKFPD 
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FIGURE 3 (2) 



GGTTTCTGGCTAGGAGAGCAGCTGGTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATT 
GF'WLGEOLVCWQAGTTPVJNI 

TTCCCAGTCATCTCACTCTACCTAATGGGTGAGGTTACCAACCAGTCCTTCCGCATCACC 
FPVISLYL. MGEVTNQSFRIT 

ATCCTTCCGCAGCAATACCTGCGGCCAGTGGAAGATGTGGCCACGTCCGAAGACGACTGT 
ILPQQYLRPVEDVATSQDDC 

TACAAGTTTGCCATCTCACAGTCATCCACGGGCACTGTTATGGGAGCTGTTATCATGGAG 
YKFAISQSSTGTVMGAVIME 

GGCTTCTACGTTGTCTTTGATCGGGCCCGAARACGAATTGG L " l ' i - IX3 CTGTCAGCGCTTGC 
GFYVVFDRARKRIGFAVSAC 

CATGTGCACGATGAGTTCAGGACGGCAGCGGTGGAAGGCCCITTTGTCACCTTGGACATG 
HVHDEFRTAAVEGPFVTLDM 

GAAGACTGTGGCTACAACATTCCACAGACAGATGAGTCAACCCTCATGACCATAGCCTAT 
EDCGYNl PQTD.ESTLMTIAY 

GTCATGGCTGCCATCTGCGCCCTCTTCATGCTGCCACTCTGCCTCATGGTGTGTCAGT^ 
VMAAICALFMLPLCLMVCOW 

CGCTGCCTCCGCTGCCTGCGCCAGCAGCATGATGACTTTGCnXSATGACATCTCCCTGCTG 
RCLRCLRQQHDDFADDISLL 

AAGTGAGGAGGCCCATGGGCAGAAGATAGAGATTCCCCTGGACCACACCTCCGTGGTTCA 
K 

CTTTGGTCACAAGTAGGAGACACAGATGGCACCTGTGGCCAGAGCACCTCAGGACCCTCC 

CCACCCACCAAATGCCTCTGCCTTGATGGAGAAGGAAAAGGCTGGCAAGGTGGGTTCCAG 

GGACTGTACCTGTAGGAAACAGAAAAGAGAAGAAAGAAGCACTCTGCTGGCGGGAATACT 

CTTGGTGACCTCAAATTTAAGTCGGGAAATTCTGCTGCTTGAAACTTCAGCCC^ro 

TTGTCCACCATTCCTTTAAATTCTCCAACCCAAAGTATTCTTCTTTTCTTAGT^^ 

GTACTGGCATCACACGCAGGTTACCTTGGCGTGTGTCCCTGTGGTACCCTGGCAGAGAAG 

AGACCAAGCTTGTTTCCCTGCTGGCCAAAGTCAGTAGGAGAGGATGCACAGTTTGCTATT 

TGCTTTAGAGACAGGGACTGTATAAACAAGCCTAACATTGGTGCAAAGATTGCCTCTTGA 

ATTAAAAT^AAAAAAAAAAAAAAAAAAAAAA 
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riGDRS 4 



ATMCCCCAGCGCTGCACTGGCTCC'^GCTATGGGTTCGCTCGGGAArGC^CCTGCCCAG 

MAPALHWLLLWVGSGMLPAQ 

GGAACCCATCTCGGCATCCGGCTGCCCCTTCGCAGCGGCCTGGCAGGGCCACCCCTGCSGC 

G THLG I R LPLR SGIjAGP PLG 

CTGAGGCTGCCCCGGGAGACTGACGAGGAATCGGAGGAGCCTGGCCOGAGAGGCAGCTTT 

LRLPRETDEESEEPGRRGSF 

GTGGAGATCGTWSACAACCTCAGGGGAAAGTCCGGCCAaKKTACTATCTOCAGATGACC 

VEMVDNLRGKSGCGYYVE M 

GTAGGCAGCCCCCCACAGACGCTCAACATCC'I^SGTGGACACGGGCACTAGTAACTTTGCA 

VGSPPQTL.NILVDTGSSNFA 

GTGCKKXKriXXCCCACACCCTT?CCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACA 

VGAAPHPFLHRYYQRQLSST 

TATCGAGACCTCCGAAAGGGTGTGTATGTGCCCTACACCCAGGGCAAGTGGGAGGGGGAA 

YRDLRKGVYVPYTQGKWEGE 

CTGGGCACCGACCrrGGTGAGCATCCCTCATCGCCCCAACGTCAClGTGCG'^GCC^^ 

LGTDLVS I PHGPNVTVRANI 

GCTGCCATCACTGAATCGGACAAGTTCTTCATCAATCGTTCCAACTGGGAGGGCATCCT^ 

AAITESDKFFINGSKWEGIIj 

GGGCTGGCCTATGCTCACATTGCCAGGCCCGACGACTCTTTGGAGCCCTTC^^ 

GLAYAEIARPDDSLEPFFDS 

CTGGTGAAGCAGACCCACATTCCCAACATCTTTTCCCTGCAGCTCTGTGGCGCT^ 

LVKQTHI PNIFSLCLCGAGF 

CCCCrrCAACCAGACCGAGGCACTGGCCTCGGTGGGAGGGAGCATGATCATTGGTGGTATC 

PLNQTEALASVGGSMIIGGI 

GACCACTCGCTATACACXSGGCAGTCTCTGGTACACACCCATCCGGCaGGAGTGGTATTAT 

DKSLYTGSIjWYTI*! rrewyy 

GAAGTGATCATTCTaiCSTGTGGAAATCAATGGTCAAtyVTCTCAAGArrGGAC^^ 

EVI IVRVE INGQDLKMDCKE 

TACAACTACGACAAGAGCAlTCTrGGACAGTGGGACCACCAACCTTCGCTTGCCCAAGA^ 

YNYDKSIVDSGTTNLRLPKK 

GTATTTGWiGCTGCCGTCAAGTCCATCAAGGCAGCCTCCTCGACGGAGAAGTTCCCGGAT 

VFEAAVKS IKAASSTEKFPD 

GGCTTTIOTCTAGGGGAGCyVCCTGGTGTGCTGGCAAGCAGGCACGACCCCr^^ 

GFWLGEQLVCWQAGTTPWNI 

TTCCCAGTCATTTCACTTTACCTCATGGGTGAAGTCACCAATCAGTCCTTC 

FPVl SLYLMGEVTNQSFRIT 

ATCCTTCCTCAGCAATACCTACGGCCGGTGGAGGACGTGGCCACGTCCCAAGACGACTC3T 

ILPQQYLRPVEDVATSQDDC 

TACAAGTTCGCTCTCTCACAGTCATCCACGGGCACTGTTATGGGAGCCGTCATCATGGAA 

y K F AVS2SSTGTVM G A V I M E 

GFYVVFDRARKRIGFAVSAC 

CATGTCCACGATGACTTCAGGACGGCGGCAGTGGAAGGTCCG'rTTGTTACGGCAGACATG 

HVHDEFRTAAVEGPFVTADM 

GAAGACrnrTCGCTACAACATTCCCCAGACAGATCAGTCAACACTTATGACCATAGCCTAT 

EDCGYNIPQ T D E S T L M T I a Y 

gtcatggcggccatctgcgccc'i^'i'i'caixw'igccactcigcctcatggtatgtcagtog 

VMAAICALFMLPLCLHVCQW 

CGCTGCCTGCGTTGCCTGCGCCACCAGCACGATCACTrTGCrroATGACATCTCCCTGCTC 

RCLRCLRKQHDCFADDISLL 

AAGTAAGGAGGCTCGTGGGCAGATGATGGAGACGCCCCTGGACCACATCTGGGTGGrrCC 

K 

CTTTGGTCACAlXSAGTTGGAGCTATGGATGGTACC'roTGGCCAGAGCACCTCAGGACCCT 

CACCAACCTGCCAATGCTTCTGGCGrraACAGAACAGAfiAAATCAGGCAAGCIGGAT^ 

GGGCTTGCACCTGTAGGACACAGGAGAGGGAAC3GAAGCAGCGTTCTGGTGGCAGGAATAT 

CCTTAGGCACCACAAACTTGAGTTGGAAATTrroCTGCTTGAAGCTTCAGCCC^ 

CTGCCCAGCATCCTTTAGAGTCTCCAACCTAAAGTATTCTTTATGTCCTTCCAGAAGTAC 

TGGCGTCATACTCAGGCTACCCGGCATGTGTCCCTGTCGTACCCTGGCAGASAAAGGGCC 

AATCTCATTCCCTGCTGGCCAAAGTCAGCAGAAGAAGGTGAAGTTTGCCAGTTC 

TCATAGGGACTGCAGACTCAAGCCTACACTGGTACAAAGACTSCGTCTTQAGATAAACAA 

GAA 
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1 -MAOALPVfLLLWMGAGVLPAHGTQHGIRLPLRSGLGGAPLGLRLPRETDEE 50 

II li IIIIM-MII II IIIIIIIMI I Mlllllllllll 
1 MAPAUiWLtiWVGSGMLPAQGTHLGIRLPLRSGLAGPPLGLRLPRETDEE 50 

51 PEEPGRRGSFVEMVnmJlGKSGQGYYVEKTVGSPPQTLNILVDTGSSNFA 100 

IMIIMIIIIMIIIIIIIIIIMIIMIIillllllllllllllMI 

51 SEEPGRRGSFVEMVimRGKSGQGYWEMTVGSPPQTLJIILVDTGSSNFA 100 

101 VGAAPHPFLHRYYQRQLSSTVRDLRKGVWPYTOGKWEGELGTDLVSIPH 150 

tllMtlllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
101 VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 150 

151 GPNVTVRANIAAITESDKFFINGSNWEGILGLAyAEIARPDDSLEPFPDS 200 

llllllllllllllllllllllllllllillMlilllllllllllilll 

151 GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 200 
201 LVKQTHVPNLFSLQLCGAGFPUJQSEVIASVGGSMIIGGIDHSLYTGSLW 250 

MlllhlhlllllMIIIIIII-i lllllllllllllllllllllll 

201 LVKQTHIPNIFSLQLCGAGFPLNQTEALASVGGSMIIGGIDHSLVTGSLW 250 

251 YTPIRREVJYYEVIIWVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 300 

lllilllllMIIIIMIIIIIilllllllllllltlllllltllllMI 
251 YTPIRREWYYEVIIVRVEINGQDLKMDCKEYNYDKSIVDSGTTNLRLPKK 300 

301 VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 350 

llllllllllllilllllllllllllllllllllllllllllllilllll 
301 VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFFVISLYLMG 350 

351 EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 400 

IIIIIIIIIMIIIIIIIIIIIillMilllllhllllllllllliMI 

351 EVTNQSFRITILPQQYIiRFVEDVATSQDDCYKFAVSQSSTGTVMGAVIME 400 

401 GFYWFDRARKRIGPAVSACHVHDEFRTAAVEGPFVTLDMEDCGyNIPQT 450 

llllllllllinilltllllllllllllllllllll llllilllltil 
401 GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTADMEDCGYNIPQT 450 

451 DESTI^IAYVMAAICALFHLPIiCLMVCQWRCLRCLRQQHDDFADDISLL 500 

llllllllltMIIIIIIIIIIIIIIIIMIIIIIII llllllllllll 
451 DESTLMTIAYVMAAICALFMLPI^IJIVCQWRCLRCLJ^HQHDDPADDISLL 500 

501 K 501 

1 

501 K 501 
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riOCTRB 6 (1) 

ATCKCTAGCATGACTGGTGGACAGCAAATt3GGTCGCGGATCCAC=CAGCACGGCATCCGG 
MASMTSGQQKGRGS T Q H C I R 

CTGCCCCTGCGCAGCGGCCTGGGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACC 
LPLRSGLGGAPLGLHLPRET 

GACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCTTTGTGGAGATGGTCGACAACCTG 
DEEPEEPGRRGSFVEMVDNL 

AGGGGCAAGTCGGGGCAGGGCTACTACGTOGAGATGACCGTGGGCAGCCCCCCGCAGACG 
RGKSGQGYYVEMTVGSPPQT 

CTCAACATCCTGGT3GATACAGGCAGCAGTAACTTTGCAGTGGGTGCTGCCCCCCACCCC 
LN I LVDTGSSN^FAVGAAPHP 

TrCCTGCATCGCTACTACCAGAGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGC 
FLHRYYaRCt-SSTYRDLRKG 

GTGTATGTGCCCTACACCCAGGGCAAGTGGGAAGGGGAGCTGGGCACCGACCTGGTAAGC 
VYVPYTQGKWEGELGTDLVS 

ATCCrcCATGGCCCCAACGTCACTCTGCGTGCCAACATTGCTGCCATCACTGAATC^ 
IPHGPKVTVRAKTAAITESD 

AAGTTCTTCATCAACGGCTCCAACTGGGAAGGCATCCTKXSGGCTGGCCT^ 
KFFI NGSNWEGI LGLAYAEI 

GCCAGGCCTGACGACTCCCTGGAGCCTTTCTTTGAC' 
ARPDDSLEPFFD 

CCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTCCCCCTCAACCAGTC^ 
PNLFSLQLCGAGFPLNOSEV 

CTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATCGACCACTCC 
LASVGGSMI I6GIDHSLYTG 

AGTCTCTCGTATACACCCATCCGGCGGGAGTGGTATTATGAGGTCATCATTGTG 
SLWYTPrRREVJYYEVI IVRV 

GAGATCAATGGACAGGATCTGAAAATCGACTGCAAGGAGTACAACTATGACAAGAGCATT 
EINGQDLKMDCKEYNYDKS 1 

GTGGACAGTGGCACCACCAACCTTCGTTTGCCCAAGAAAGTGTTTGAAGCT^ 
VDSGTTNLRLPKKVFEAAVK 

TCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCCTGATGGTTrCTGGCTAGGAGAGCAG 
SIKAASSTEKFPD3FWLGEQ 

CTOSTGTGCTGGCAAGCAGGCACCACCCCTTGGAACATTTTCCCACrCATCTCACTCTAC 
LVCWQAGTTPWNIFPVI SLY 

LMGEVTNCSFRIT ILPQQYL 
CGGCCAGTGGAAGATGTGGCCACGTCCCAAGACGACTGTTACAAGTTTGCCATCTCACAG 
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PIOORE £ (3) 



RPVEDVATSQDDCYKFAI SQ 

TCATCCACGGGCACTCrTTATGGGAGCTGTTATCATGGAGGGCTTCTACGTTGTCTTTGA 
SSTGTVKGAVIHEGFYVVFD 

CGGGCCCGAAAAMAATTGGCTTTGCTtna«;CGCTTGCC^ 

RARKRIG-FAVSACHVHDEFR 

ACGGCAGCGGTCGAAGGCCCTITIXTrCACClTCGACATGGAAGACTGTG^ 
TAAVEGPFVTLDMEDCGYKI 

CCACAGACAGATGAGTCATGA 
P C T D E S • 
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noma 7 (i) 

AlXKCTAGCATGACTGGTX3GACAGCAAATG(3C?rC(KOGATCGATt^ 
■ MA SKTGGQ5MGRG S M T I S PS 

CCGCTrrGAACAGGACGGATCCACCCAGCACGGCATCOMCTGCCCCTGCGCAGCGGCCTG 
P R E Q D G S TQHGIRLPLRSGL. 

GGGGGCGCCCCCCTGGGGCTGCGGCTGCCCCGGGAGACCGACCAAGAGCCCGAGGAGCCC 
GCAPLGLRLPRETDEEPEEP 

GGCCGGAGGGGCAGCTTTGTXXSAGATGGTGGACAACCTCAGGGGCAAGTCGGGGCAGGGC 
GRRGSFVEMVDNLRGKSGQG 

TACTACGTGGAGATGACCGTGGGCAGCCCCCCGCAGACGCIXJACATCCTGGTGGATAC 
YYVEMTVGSPPOTLNI LVDT 

GGCAGCAGTAACTTTCCACTGCGTGCTGCCCCCCACCCCTTCCTCCATCGCTACTACCAG 
GSSKTAVGAAPHPFLHRYVQ 

AGGCAGCTGTCCAGCACATACCGGGACCTCCGGAAGGGCGTGTATGTGCCCTACACCCAG 
RQLSSTYRDLRKGVYVpyTQ 

GGCAAGTGGGAAGCS5GAGCTGGGCACCGACCTGGTAAGCATCCCCCATGGCCCCAACGTC 
GKWEGELGTDLVSIPHGPNV 

ACTGTCCGTGCCAACATTGCTGCCATCy^CTaAATCAGACAAGTTCTTC 
TVRANIAAIT5SDKFFINGS 

AACTGGGAAGCCATCCTGGGGCTGGCCTATGCTGAGATrCCCAGGCCrrGAC^ 
NWEGILGLAYAEIARPDCSL 

GAGCXTTTTCOTTGACTCTCTGGTAAAGCAGACCCACGTTCCCAACCTCT^^ 
EPFFDSLVKQTHVPHLFSLQ 

CrrrGTGGTGCTGGCTTCCCCCTCAACCAGTCTGAAG^ 

LCGAGFPLNQSEVLASV GGS 

ATGATCATrGGAGGTATCGACCACTCGCTGTACACACMCAGTCTCTQGTATACACCCATC 
MI IGGIDHSLYTGSLWYTPI 

CGGaSGGAGTGGTATTATGAOGTCATCATTGTGCGGGTGGAGATCAATTOACAGGAT^ 
RREWYYEVI IVRVEIHGQDL 

AAAATGGACTGCAAGGAGTACAACrrATGACAAGAGCATTGTGGACAGTGGCACCACCAAC 
KMDCKEYNYDKS IVDSGTTN 

CTTCGTTTGCCCAAGAAAGTGTTTGAAGCIXXIAGTCAAATCCATCA^ 
LRLPKKVFEAAVKSIKAASS 

TEKFPDGFWLGEQLVCWQAG 

ACCACCCCTTGGAACATTTrCCCAGTCATCTCACTCTACCTAATOGGTGA 
TTPWNIFPVI SLYLMGEVTN 
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CAGTCCTTCCGCATCACCATCCTTCCGCAGCAATACCTGCGGCCAGTOSAASATaTSGCC 
OSFRITILPQOYLRPVEDVA 



ACGTCCCAAGACGACICTTACAAGTTTCCCATCTCACAGTCATCCACGGGCACTG 
TSQDDCYKFAISCSSTGTVM 



GGAGCT3TTATCATGGAGGGCTrCTA( 
GAVIMECFY 



TTTGCTCrrCAC^GCTroCCATITreCACCATGAGTTCAGGACGGCAGCGGTGGAA^ 
FAVSACHVHDEFRTAAVEGP 



TTTCTCACCTIGGACATGGAAGACTCTGGCTACAACATTCCACAGACASATGAGTCAT^ 
FVTLDMEDCGYHIFQTDES* 
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PIGORE e (1) 



ATMCTCAGCATCOTATTCtnCTGCCACTGCGTAGCCSTCTGGGTCKTGC^ 
MT-QHGTRLPLRSGLGGAPLG 

CTGCGTCTt3CCCC(SCX3ACy^CCGACGAAGAGCCCGAGGAGCCCGGCCGGAGGGGCAGCrrT 
LFtLPRETDEEPEEPGRRGSF 

GTGGAGATXXnx;GACAACCTX3AGGGGCAAGTCGGGGCAGGGCTACTACGTGGA^ 
VEMVDKLRGKSGCGYYVEMT 

GTGGGCACK:CCCCCGCAGACGCTCJU^CATCCT3GTCGATACAGGCAGCAGTAACTT1^A 

vgsppqtlnii.vdtgssnfa 

grnxknxjctgccccccaccccitcctgcatcgctacrraccagaggcagctgtc 
vgaapkpplhryyqrqlsst 

taccgggacctcccxsaagcgcgtctatgtgccctacacccagggcaagtgggaaggggag 

YRDLRKGVYVPYTOGKWEGE 

CTGGGCACCGACCIXMTAAGCATCCCCCATGGCCCCAACGTCACTGTGCGTGCCAACAT^ 
LGTCLVSIP«GPNVTVRANI 

GCTGCO^TCACTGAATCAGACAAGTTCTTCATCAACGGCTCCAACTGOGARGGCATCCTC 
AAITESDKFFINGSKWEGIL 

CGGCTGGCCTATGCTGAGATTGCCAGGCCTGACGACTCCCTCGAGC C ' i ' n ' ^ 
GLAYAEIARPDDSLEPrFDS 

CTGGTAAAGCAGArcCACGTTCCCAACCTCTTCTCCCTGCAGCTTTGTGGTGCTGGCTTC 
LVKQTHVPHLFSLQLCGAGF 

CCCCTCAACCAGTCTGAAGTGCTGGCCTCTGTCGGAGGGAGCATGATCATTGGAGGTATC 
PLNQSEVLASVGGSMIIGGI 

GACCACTCGCTGTACACACGCAGTCTCTGGTATACACCCATCCGGCGGGAGT^ 
DHSLYTGSLWYTPIRREWYY 

GAGGTCATCATTCTGCGOGTGGAGATCAATGGACAGGATCTGAAAATGGAC^^ 
EVI IVRVEINGQDLKMDCKE 

TACAACTATGACAAGMCATTSTGGACAGTGGCACCACCAACCTTCGT^ 
YNYDKSIVDSGTTNLRLPKK 

GTGTTTGAAGCTCCAGTCAAATCCATCAAGGCAGCCTCCTCCACGGAGAAGTTCCC^ 
VFEAAVKSIKAASSTEKFPD 

GGTTTCTCSGCTAGGAGAGCAGCTGGTCTGCTGGCAAGCAGGCACCACCCCTTG^^ 
GFWLGEQLVCWQAGTTPWBI 

TTCCCAGTCATCTCACTCrACCTAATGGGTGAGGTTACCAACCAGTCCT^ 
FPVISLYLMGEVTNQSFRIT 

ATCCTTCCCCAGCAATACCTGCGGCCAGTGGAAGATCieGCCACGTCCCAACA^ 
ILPQCYLRPVEDVATSQDDC 
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FIGDRS e (2) 



TACAftCnTTtKCATCTCACAGTCATCCACCGGCACTGlTATGCMACCT^^ 
VKFAlSQSSrCTVMGAVIHE 

GGCTTCTACCnTCT C ' l ' l ' lXJ ATCCMGCCCGAAAACGAATTGGrri'lXJCTGTC^ 
GFYVVFDRARKRIGFAVSAC 



CATTAG 
H ♦ 
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FIGURE 9 



IP: Ab 369 
IP: AbCB 
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FICOItB 10 



Asp 



- + 




<-CTF9 



CO 

a. 

a. 
< 



ci. 
o. 
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FIGOSS 11 



t^ftiC^ft^P WL LI 'WMGAGVLPAHG TOHGIRLPmSGLGGAPLgLRLPRETDES 
PEEPGW^GSFVEMVDm^RGKSGOGYYVEtrrVGSPPQTLNILVDTXSSSNFA 
VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVFYTQGKWEGELGTDLVSIPH 
GPNVTVRANIAAITESDKFFINGSNWEGILGLAYAEIARPDDSLEPFFDS 
LVKQTH'/PNLFSI.OLCGAGFPLNQSEVLASVGGSMIIGGIDHSLYTGSLW 
YTPIRREWYYE:VIIVRVEINGQDUaiIX;KEYNYDKSIVDSGTTNLRLPKK 
VFEAAVKSIKAASSTEKFPDGFWLGEQLVa-fQAGTTPWNIFPVISLYLMG 
EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 
GFYWFDRARKRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 
DES 
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FIOURB 12 



f^ftOftTiPWL^'T- WMGAGVLPAHG TOHGIRLPLRSGLGGAPLGLRLPRETDEE 

PEEPGIUlGSFVEMVDMI^GKSGOGYYVEtm^GSPPffn-NILVIXrGSSNF 

VGAAPHPFLHRYYQRQLSSTYRDLRKGVYVPYTQGKWEGELGTDLVSIPH 

GPNVTVRANIAAITESDKFFINGSNWEGIIiGLAYAEIARPDDSLEPFFDS 

LVKQTHVPNLFSLQLCGAGFPUIQSEVIASVGGSMI IGG IDHSLYTGSLW 

YTPIRREWYYE^aiVRVEINGQDUCMDCKEYKYDKSIVDSGTTNLRLPKK 

VFEAAVKSIKAASSTEKFPDGFWLGEQLVCWQAGTTPWNIFPVISLYLMG 

EVTNQSFRITILPQQYLRPVEDVATSQDDCYKFAISQSSTGTVMGAVIME 

GFYVVFDRAIUCRIGFAVSACHVHDEFRTAAVEGPFVTLDMEDCGYNIPQT 

OESHHHHHH 
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SEQUENCE LISTING 

<110> Gumey, Mark E. 

Bienkowski , Michael J . 
Heinrikson, Robert L. 
Parodi, Luis A. 
Van. Riqiang 

Phazmacia & Upjohn Company 

<120> Alzheimer's Disease Secretase 

<130> 6177. P CP 

<140> 
<141> 

<150> 60/101,594 
<151> 1998-09-24 

<160> 49 

<17C> Patentin Ver. 2.0 

<210> 1 
<211> 1804 
<212> DNA 

<213> Homo sapiens 
<400> 1 

atgggcgcac tggcccgggc gctgctgctg cctctgctgg cccagtggct cctgcgcgcc 60 
gccccggagc tggcccc'cgc gcccttcacg ctgcccctcc gggtggccgc ggccacgaac 120 
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cgcgtagttg cgcccacccc gggacccggg 
gcgctcgccc r^gagcctgc cctggcgtcc 
gtagacaacc tgcaggggga ctctggccgc 
cccccgcaga agctacagac tctcgtcgac 
accccgcact cccacataga cacgtacttt 
aagggcuttg acgtcacagc gaagtacaca 
gacctcgtca ccatccccaa aggctccaat 
ctcgaatcag agaatttctt tttgcctggg 
tatgccacac tcgccaagcc atcaagt-ct 
caagcaaaca tccccaacgt tttctccatg 
ggatctggga ccaacggagg tagtcttgtc 
ggagacatct ggtatacccc tattaaggaa 
ttggaaaccg gaggccaaag ccctaatctg 
atcgtggaca gtggcaccac gctgctgcgc 
gaagctgtgg cccgcgcatc tctgattcca 
cagctggcgt gctggacgaa Ctcggaaaca 
tacctgagag atgagaactc cagcaggtca 
attcagccca tga-gggggc cggcctgaat 
tccacaaatg cgctggtgat cggtgccacg 
agagccsaga agagggtggg cttcgcagcg 
gtgtctgaaa tctccgggcc tttctcaaca 
cagtctttga gcgagcccat tttgtggact 
gccaccctcc ttgccttaat cgtcctgcLg 
cgtgaceccg aggtcgtcaa tgatgagtcc 
gccaggcccg acctcaagca accatgaact, 
agcagccggg atcgatggtg gcgctttctc 
gcccccagat gccttctaga -tcactgtct 
ctccctacrt ccaagaaaaa raattaaaaa 

<210> 2 



acccccgccg agcgccacgc cgacggctcg 180 
cccgcgggcg ccgccaactt cttggccatg 240 
ggctaccacc tggagatgct gatcgggacc 300 
actggaagca gtaactttgc cgtggcagga 360 
gacacagaga ggtctagcac ataccgctcc 420 
caaggaagct ggacgggctt cgttggggaa 480 
acttcttLtc tcgtcaacat tgccactact 540 
attaaatgga atggaatacc cgccccagct, 600 
ctggagacct tcttcgactc cctggtgaca 660 
cagatgtgtg gagccggctt gcccgttgcL 720 
ttgggcggaa ttgaaccaag ttrgcataaa 730 
gagtggtact accagataga aactccgaaa 840 
gactgcagag agtataacgc agacaaggcc 900 
ctgccccaga aggtgtttga tgcggtggtg 960 
gaattctcCff atggtttctg gactgggccc 1020 
ccttggtctt acttccctaa aatctccatc 1080 
ttccgtatca caatcctgcc tcagctttac 1140 
tatgaatgtt accgattcgg catttcccca 1200 
gcgatggagg gcttctacgt catcttcgac 1250 
agcccctgtg cagaaattgc aggtgctgca 1320 
gaggatgtag ccagcaactg !:gtccccgct 1380 
gcfftcctatg cgctcatgag cgcctgtgga 144C 
ctgctgccgt tccggcgcca gcgtcgcccc 1500 
tctctggtca gacatcgctg gaaatgaata 1560 
cagctattaa gaaaatcaea cctccagggc 1620 
ctgtgcccac ccgtctccaa tctctgttci 1680 
tttgattctt gattttcaag ctctcaaatc 1740 
aaaaacctca ttctaaacca aaziaaaaaaa 1800 

1804 
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<211> 518 

<212> PRT . 

<213> HoiLo sapiens 

<400> 2 

Met Gly Ala Leu Ala Arg Ala Leu Leu Leu Pro Leu Leu Ala Gin Trp 
1 5 10 15 

Leu Leu Arg Ala Ala Pro Glu Leu Ala Pro Ala Pro Phe Thr Leu Pro 
20 25 30 

Leu Arg Val Ala Ala Ala Thr Asn Arg Val Val Ala Pro Thr Pre Gly 
35 4C 45 

Pro Gly Thr Pro Ala Glu Arg His Ala Asp Gly Leu Ala Leu Ala Leu 
50 55 60 

Glu Pro Ala Leu Ala Ser Pro Ala Gly Ala Ala Asn Phe Leu Ala Met 
6B 70 75 80 

Val Asp Asn I.eu Gin Gly Asp Ser Gly ArgJW-y Tyr Tyr Leu Glu Met 
85 90 95 

Leu He Gly Thr Pro Pro Gin Lys Leu Gin He Leu Val Asp Thr Gly 
100 105 110 

Ser Ser Asn Phe Ala Val Ala Gly Thr Pro His Ser Tyr He Asp Thr 
115 120 125 

Tyr Phe Asp Thr Glu Arg Ser Ser Thr Tyr Arg Ser Lys Gly Phe Asp 
130 ~ 135 140 



3 



PCT/US99/20881 



Val Thr Val -Lys Tyr Thr Gin Gly Ser Trp Thr Gly Phe Val Gly Glu 
145 150 155 160 



Asp Leu Val Thr lie Pro Lys Gly Phe Asn Thr Ser Phe Leu Val Asn 
165 170 175 



lie Ala Thr lie Phe Glu Scr Glu Asn Phe Phe Leu pre Gly lie Lys 
ISO IBS 190 



Tip Asn Gly He Leu Gly Leu Ala Tyr Ala Thr Leu Ala Lys Pro Ser 
195 200 205 



Ser Ser Leu Glu Thr Phe Phe Asp Ser Leu Val Thr Gin Ala Asn He 
210 215 220 



Pro Asn Val Phe Ser Met Gin Met Cys Gly Ala Gly Leu Pro Val Ala 
225 230 235 240 



Gly Ser Gly Thr Asn Gly Gly Ser Leu Val Leu Gly Gly He Glu Pro 
245 250 255 



Ser Leu Tyr Lys Gly Asp He Trp Tyr Thr Pro He Lys Glu Glu Trp 
260 265 270 



Tyr Tyr Gin He Glu He Leu Lys Leu Glu He Gly Giy Gin Ser Leu 
275 280 285 



Asn Leu Asp Cys Arg Glu Tyr Asn Ala Asp Lys Ala He Val Asp Ser 
290 295 300 
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Gly Thr Hir Leu Leu Arg Leu Pro Gin Lys Val Phe Asp Ala Val Vai 
305 ' 310 315 320 

Glu Ala Val Ala Arg Ala Ser Leu lie Pro Glu Phe Ser Asp Gly Phe 
325 330 335 

Trp Thr Gly Ser Gin Leu Ala Cys Trp Thr Asn Ser Glu Thr Pro Trp 
340 345 350 

Ser Tyr Phe Pro Lys He Ser lie Tyr Leu Arg Asp Glu Asn Ser Ser 
355 360 365 

Arg ser Phe Arg He Thr He Leu Pro Gin Leu Tyr He Gin Pro Met 
370 375 380 

Met Gly Ala Gly Leu Asn Tyr Glu Cys Tyr Arg Phe Gly He Ser Pro 
385 390 395 400 

Ser Thr Asn Ala Leu val He Gly Ala Thr Val Met Glu Gly Phe Tyr 
405 41C 415 

Val He Phe Asp Arg Ala Gin Lys Arg Val Gly Phe Ala Ala Ser Pro 
420 425 430 

Cys Ala Glu He Ala Gly Ala Ala Val Ser Glu He Ser Gly Pro Phe 
435 440 445 

Ser Thr Glu Asp Val Ala Ser Asn cys Val Pro Ala Gin ser Leu ser 
450 455 460 

Glu Pro He Leu Trp He Val Ser T^t Ala Leu Met Ser Val Cys Gly 
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465 470 475 480 



Ala He lieu Leu Val Leu He Val Leu Leu Leu Leu Pro Phe Axg Cys 
485 490 495 



Gin Arg Arg Pro Arg Asp Pro Glu Val Val Asn Asp Glu Ser Ser Leu 
500 505 510 



Val Arg His Arg Trp Lys 
SIS 



<210> 3 

<211> 2070 

<212> DNA 

<213> Home sapiens 



<400> 3 



atggcccaag 


ccctgccctg 


gcccctgctg 


tggatgggcg 


cgggagtgst 


gcctgcccac 


60 


ggcacccagc 


acggcatccg 


gctgcccctg 


cgcagcggcc 


Cggggggcgc 


ccccctgggg 


X20 


ctgcggcCgc 


cccgggagac 


cgacgaagag 


cccgaggagc 


ccggccggag 


gggcagcctc 


lao 


gtggagatgg 


tggacaacct 


gaggggcaag 


ccggggcagg 


gccactacgt 


ggagatgacc 


240 


gcgggcagcc 


ccccgcagac 


gctcaacatc 


ctggtggaca 


caggcagcag 


taacttcgca 


300 


gtgggtgctg 


ccccccaccc 


cttcctgcat 


cgctactacc 


agaggcagct 


gtccagcaca 


360 


taccgggacc 


tccggaaggg 


tgtgtacgtg 


ccctacaccc 


agggcaagtg 


ggaaggggag 


420 


ctgggcaccg 


acctggtaag 


catcccccat 


ggccccaacg 


tcactgtgcg 


tgccaacatt 


480 


gctgccatca 


ctgaatcaga 


caagttctcc 


atcaacggct 


ccaaccggga 


aggcatcctg 


540 


gggctggcct 


a t get gaga t 


tgccaggcct 


gacgactccc 


tggagccttr 


ctttgactct 


600 


ctggtaaagc 


agacccacgt 


tcccaacctc 


ttcuccccgc 


acccttgtgg 


tgccggcttc 


660 


cccctcaacc 


agtctgaagt 


gctggcccct 


gtcggaggga 


gcatgatcat 


tggaggtatc 


720 


qaccactcgc 


tgtacacagg 


cagtctcCgg 


Catacaccca 


tccggcggga gtggtaCtat 


780 
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gaggtcatca 


tcgcgcgggr ggagatcaat 


ggacaggatc 


rgaoaacgga 


ccgcaaggag 


840 


tacaactatg 


a'caagagcat tgtggacagt 


ggcaccacca 


accttcgttt 


gcccaagaaa 


900 


gtgtttgaag 


ccgcagtcaa atccatcaag 


gcagcctcct 


ccacggagaa 


gtccccLgaL 


960 


ggtttctggc 


waggagagca gctggcgtgc 


tggcaagcag gcaccacccc 


ctggaacatt 


1020 


ttcccagtca 


-CLcacccta cccaatgggt 


gaggttacca 


accagtcctt 


ccgcatcacc 


1080 


atccttccgc 


agcaacacct gcggccagtg gaagatgtgg 


ccacgtccca 


agacgactgt 


1140 


cacaagccLg 


ccaccccaca gtcacccacg 


ggcactgtta 


tgggagctgt 


tatcatggag 


1200 


ggcttctacg 


Ctgtctctga tcgggcccga aaacgaattg gcttrgctgt 


cagcgcttgc 


1260 


cacgtgcacg 


atgagtncag gacggcagcg 


gtggaaggcc 


cttt-gtcac 


cttggacatg 


1320 


gaagac^gtg 


gctacaacat tccacagaca 


gatgagtcaa 


ccctcacgac 


catagcc'at 


13B0 


gtcatggctg 


ccatctgcgc cctcttcatg 


ctgccastcr 


gcctcatggt 


gtgccagtgg 


1440 


cgctgcctcc 


gctgcctgcg ccagcagcac 


gatgactttg 


:;tgatgacat 


ctccctgctg 


ISOO 


aagtgaggag 


gcccatgggc agaagataga gattcccctg gaccacaccc 


ccgtggttca 


1560 


ctttggccac 


aagtaggaga cacagatggc 


acctgtggcc 


agagcacctc 


aggaccctcc 


1620 


ccacccacca 


aatgcctctg ccttgatgga 


gaaggaaaag 


gccggcaagg 


tgggctccag 


1680 


ggactgcacc 


tgtaggaaac agaaaagaga 


agaaagaagc 


actctgctgg 


cgggutact 


1740 


cttgg^cacc 


Lcaaatttaa gtcgggaaat 


tctgctgctt 


gaaacttcag 


ccctgaacct 


1800 


ttgtccacca 


ttcctttaaa ttctccaacc 


caaagtattc 


ttCLtttCtt 


agtttcagaa 


1860 


gtactggcat 


cacacgcagg ttaccttggc gtgtgtcccc gtggtaccct 


ggcagagaag 


1920 


agaccaagcr 


tgtccccccg ctggccaaag 


tcagtaggag 


aggatgcaca 


gtttgctatt 


1980 


tgctttagag 


acagggactg tataaacaag 


cctaacattg gtgcaaagat 


tgcctcttga 2040 


attaaaaaaa 


aaaaaaaaaa aaaaaaaaaa 








207C 



<210> 4 

<211> 501 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Ala Gin Ala Leu Pro Trp L«u Leu Leu Trp Met Gly Ala Gly Val 

15 10 15 
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Leu Pro Ala' His Gly Thr Oln His Gly lie Arg Leu Pro heu Arg Ser 
20 25 30 

Sly Leu cly Gly Ala Pro Leu Gly Leu Arg Leu pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Her Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu Val Asp Tiir Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 



Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
163 170 175 
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Glu Gly Hg Lqu Gly Leu Ala lyr Ala Glu lie Ala Arg Pro Asp Asp 
-180 185 190 



Ser Leu Glu Pro ?he Phe Asp Ser Leu Val Lys Gin Thr His Val Pre 
195 200 20b 



Asn Leu Phe Ser Z,eu His Leu cys Gly Ala" Gly Phe Pro Leu Asn Gin 
210 215 220 



Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He lie Gly Gly lie 
225 230 235 240 



Asp His Ser Leu Tyr Thr Gly Ser Leu Tip Tyr Thr Pro He Arg Arg 
245 ' 250 255 



Glu Tip Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 



Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 230 285 



Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys val Phe Glu Ala 
290 295 300 



Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
306 310 315 320 



Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 



Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
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Thr Asn Gin Ser Phe Arg lie Thr lie Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 



Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr hys Phe Ala 
370 375 380 



He Ser Gin ser Ser Thr Gly Thr Val Met Gly Ala Val lie Met Glu 
385 390 39S 400 



Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 



Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 



Gly Pro Phe Val Thr Leu Asp Met Glu Asp cys Gly Tyr Asn He Pro 
435 440 445 



Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Met Ala Ala 
4S0 4SS 460 



lie cys Ala Leu Phe Met Leu Pro Leu cys Leu Met Val cys Gin Trp 
465 470 475 480 



Arg Cys Leu Arg Cys Leu Arg Gin Gin His Asp Asp Phe Ala Asp Asp 
4S5 490 495 



He Ser Leu Leu Lys 
500 
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<210> 5 

<211> 1977 

<212> DNA 

<213> Homo sapiens 

<400> 5 

aCggcccaag ccctgccctg 
ggcacccagc acggcatccg 
ctgcggctgc cccgggagac 
gcggagatgg tggacaacct 
gtgggcagcc ccccgcagac 
gcgggtgctg ccccccaccc 
taccgggacc tccggaaggg 
ctgggcaccg acczggtaag 
gctgccatca ctgaatcaga 
gggctggcc- atgccgagat 
gaagtgccgg cctctgtcgg 
acaggcagtc tctggtatac 
cggg^ggaga tcaatggaca 
agcattgtgg acagtggcac 
gtcaaatcca tcaaggcagc 
gagcagctgg tgtgctggca 
ctctacctaa tgggtgaggt 
cacctgcggc cagcggaaga 
tcacagtcat ccacgggcac 
tttgatcggg cccgaaaacg 
ttcaggacgg cagcggtgga 
aacattccac agacagacga 
tgcgcccrct ccatgctgcc 



gctcctgctg -ggatgggcg cgggagtgct gcctgcccac 60 
gctgeccctg cgcagcggcc tggggggcgc ccccctgggg 120 
cgacgaagag cccgaggagc ccggccggag gggcagcttt 180 
gaggggcaag ccggggcagg gccactacgt ggagatgacc 240 
gcccaacatc ctggiggata caggcagcag taactttgca 300 
cctcctgcat cgctactacc agaggcagct gcccagcaca 360 
Cgtgtatgtg ccctacaccc agggcaagtg ggaaggggag C2Q 
catcccczat ggccccaacg tcactgCgcg tgccaacatt 480 
caagctcttc atcaacggct ccaactggga aggcatcctg 540 
tgccaggctc tgtggtgccg gctcccccct caaccagtct 600 
agggagcatg atcattggag gcatcgacca ctcgccgtac 660 
acccatccgg cgggagtggt attatgaggt garcattgcg 720 
ggatctgaaa atggactgca aggagtacaa ctatgacaag 780 
caccaacctc cgtttgccca agaaagtgtt tgaagctgca B40 
ctccrccacg gagaagttcc ctgatggttt ctggctagga 900 
agcaggcacc accccttgga acattttccc agtcaCctca 960 
taccaaccag tccttccgca tcaccatcct tccgcagcaa 1020 
cgtggccacg tcccaagacg actgttacaa gtttgccatc 1080 
cgttatggga gctgtcatca tggagggcct ctacgctgtc 1140 
aattggctit gctgtcagcg cttgccatgt gcacgatgag 1200 
aggccctt^t gccaccttgg acacggaaga ctgtggctac 1260 
gtcaaccctc acgaccacag cctatgtcac ggctgccatc 1320 
actctgcctc atggcgtgtc agtggcgctg cc-ccgctgc 1380 
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ctgcgccacc 


agcatgatga ctttgccgat 


gacatctccc 


tgctgaagtg 


aggaggccca 


1440 


tgggcagaag 


aitagagattc 


ccctggacca 


cacccccg-g 


gttcactttg 


gtcacaagta 


15O0 


ggagacacag 


aCggcacctg 


tggccagagc 


acctcaggac 


cccccccacc 


caccaaatgc 


1560 


ctctgccttg 


atggagaagg 


aaaaggctgg 


caaggtgggt 


tccagggact 


gtacctgtac 


1620 


gaaacagaaa 


agagaagaaa 


gaagcactct 


gctggcggga 


a-artcctgg 


tcacctcaaa 


1660 


tttaagtcgg 


gaaattctgc 


tgcttgaaac 


ttcagecctg 


aacctttgtc 


caccattcct 


1740 


ttaaattccc 


caacccaaag 


tattcttctt 


ttcttagrtt 


cagaagtact 


ggcatcacac 


1800 


gcaggttacc 


ttggcgtgtg 


tccctgtggt 


accctggcag 


agaagagacc 


aagcttgttt 


1860 


ccctgctggc 


ca£iagtcagt 


aggagaggat 


gcacagtttg 


ctatttgctt 


tagagacagg 


1920 


gactgtataa 


acaagcctaa 


caCtggtgca 


aagatcgcct 


cttgaaaaaa 


aaaaaaa 


1977 



<21Q> 6 

<211> 476 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Gly Ala Gly Val 
15 10 15 

Leu Pro Ala His Gly Thr Gin His Gly He Arg Leu pro Leu Arg Ser 
20 25 30 

Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr 
65 ~ 70 75 80 
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VaI Gly Ser Pro Pro Gin Thr Lou Asn lie Leu Val Asp Thr Gly Ser 
85 90 95 



Ser Asn Plie Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 lOS 110 



Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 



Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 



Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 



Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asr. Trp 
165 170 175 



Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Leu cys Gly 
ISO 185 190 



Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly 
195 200 205 



Ser Met Ha He Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu 
210 215 220 



Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val 
225 230 235 240 
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Arg Val Glu He Asn Gly Gin Asp Lgu Lys Met Asp Cys Lys Giu Tyr 
245 250 25S 



Asn Tyr Asp Lys Ser He Val Asp Ser Gly Ttir Tftr Asn Leu Arg Leu 
260 265 270 



Pro Lys Lys Val Phe Glu Ala Ala Val Lys Ser He Lys Ala Ala Ser 
275 280 285 



Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val 
290 295 300 



Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser 
305 310 315 320 



Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He 
32S 330 335 



Lgu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Aia Thr Ser Gin 
340 345 350 



Asp Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val 
355 360 365 



Met Gly Ala Val He Met Glu Gly Phe Tyr Val Val Phe Asp Arg Ala 
370 375 380 



Arg Lys Arg He Gly Phe Ala Val Ser Ala Cys Kis Val His Asp Glu 
385 390 395 4O0 



Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu 
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405 410 415 

Asp Cys Gly Tyr Asd lie Pro Gin Thr Asp Glu Ser Thr Leu Met Thr 
420 425 430 

lie Ala Tyr Val Met Ala Ala lie Cys Ala Leu Phe Met Leu Pro Leu 
435 440 445 

Cys Leu Met Val Cys Gin Trp Arg Cys Leu Arg Cys Leu Arg Gin Gin 
450 455 460 

His Asp Asp Phe Ala Asp Asp lie Ser Leu Leu Lys 
465 470 475 



<210> 7 
<211> 2043 
<212> DHA 

<213> Mas musculus 



<400> 7 



atggccccag 


cgctgcactg 


gctcctgcta 


tgggtgggct 


cgggaatgct 


gcctgcccag 


6C 


ggaacccatc 


tcggcatccg 


gctgcccctt 


cgcagcggcc 


tggcagggcc 


acccctgggc 


120 


ctgaggctgc 


cccgggagac 


tgacgaggaa 


tcggaggagc 


ctggccggag 


aggcagctct 


180 


gtggagatgg 


tggacaacct 


gaggggaaag 


tccggccagg 


gctactatgt 


ggagatgacc 


240 


gtaggcagcc 


ccccacagac 


gctcaacatc 


ctggtggaca 


cgggcagtag 


taactttgca 


300 


gtgggggctg 


ccccacaccc 


ttccctgcac 


cgctactacc 


agaggcagct 


gtccagcaca 


360 


tatcgagacc 


tccgazuiggg 


tgcgtatgtg ccctacaccc 


agggcaagtg ggagggggaa 


420 


ctgggcaccg 


acctggtgag 


catccctcat 


ggccccaacg 


tcactgcgcg 


tgccaacatt 


480 


gctgccacca 


ctgaaccgga 


caagctctcc atcaatggct 


ccaactggga gggcatccta 


540 


gggctggcct 


atgctgagat 


tgccaggccc 


gacgactctt 


tggagccctt 


ctttgactcc 


600 
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ctggtgaagc 


agacccacat 


tcccaacatc 


cccctcaacc 


agaccgaggc 


accggcctcg 


gaccactcgc 


tatacacggg 


cagtctccgg 


gaagtgacca 


CCgcacgcgc 


ggaaatcaaL 


tacaacLacg 


acaagagcat 


tgtggacagt 


gtatttgaag 


ctgccgtcaa 


gtccatcaag 


ggcttttggc 


taggggagca 


gctggtgtgc 


ttcccagtca 


tttcactrca 


cctcatgggt 


atccttcctc 


agcaatacct 


acggccggtg 


cacaagttLcg 


ctgtctcaca 


gtcacccacg 


ggtttccacg 


tcgtccccga 


tcgagcccga 


catgcgcacg 


atgagctcag 


gacggcggca 


gaagac bgtg 


gctacaacat 


iccccagaca 


gtcatggcgg 


ccatctgcgc 


ccrcttcatg 


cgctgcctgc 


gttgcctgcg 


ccaccagcac 


aagtaaggag 


gctcgtgggc 


agatgatgga 


ctttggtcac 


atgagt.gga 


gctatggatg 


caccaacctg 


ccaatgctcc 


tggcgtgaca 


gggctcgcac 


ctgtaggaca 


caggagaggg 


ccttaggcac 


cacaaacttg 


agttggaaat 


ctgcccagca 


tccttcagag 


tccccaacct 


tggcgtcata 


ctcaggctac 


ccggcatgtg 


aatctcattc 


cccgctggcc 


aaagtcagca 


cgatagggac 


tgcagactca 


agcctacact 



gaa 

<210> 8 

<211> 501 

<212> PRT 

<213> Mus naisculus 



ttttccctgc agc-ctgLgg cg:;tggcttc 660 
grgggaggga gcatgatcat cggtggtatc 720 
tacacaccca tccggcggga gtggtattat 780 
ggccaagatc tcaagatgga ctgcaaggag £40 
gggaccacca accttcgctc gcccaagaaa 900 
gcagccCcct. cgacggagaa gttcccggat S6C 
cggcaagcag gcacgacccc ttggaacatt 1020 
gaagtcacca atcagtcctt ccgcatcacc 108O 
gaggacgtgg ccacgtccca agacgactgt 1140 
ggcactgtta tgggagccgc catcatggaa 1200 
aagcgaatcg gctccgccgt cagcgcccgc 1260 
gcggaaggtc cgtttgctac ggcagacacg 1320 
gatgagtcaa cacttatgac catagcctat 1380 
tcgccacTct gcctcacggt atgtcagtgg 1440 
gatgactttg ctgatgacat ctccctgctc 1500 
gacgcccctg gaccacatct gggtggttcc 1560 
gtacctgtgg ccagagcacc tcaggaccct 1520 
gaacagagaa atcaggcaag ctggattaca 1680 
aaggaagcag cgttctggtg gcaggaatat 1740 
tttgctgctt gaagcttcag ccctgaccct 1800 
aaagcaztcc ttatgtcctc ccageuigtac 1860 
tccctgtggt accctggcag agaaagggcc 1920 
gaagaaggtg aagtttgcca gttgctttag 1980 
ggcacaaaga ccgcgtcttg agataaacaa 2040 

2043 
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<4oo> e 

Met Ala Pro Ala Lou His Trp Leu Leu Leu Trp Val Gly Ser Gly Met 
15 10 IE 

Leu Pro Ala Gin Gly Thr His Leu Gly lis Arg L«u Pro Leu Arg Ser 
20 25 30 

Gly Leu Ala Gly Pro Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Ser Glu Glu Pro Gly Arg Arg Gly ser Phe Val Glu Met val 
50 55 6C 

Asp Asn Leu Arg Gly Lys Ser Gly Gin Gly Tyr Tyr Vai Glu Met Thr 
65 70 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn He Leu val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 1S5 160 
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Ala Ala lie Thr Glu Ser Asp Lys Phe Phe lie Asn Gly Ser Asr. Trp 
16b 170 1?S 



Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu lie Ala Arg Pro Asp Asp 
180 185 190 



Ser Lou Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His He Pro 
195 200 205 



Asn He Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
21C 215 220 



Thr Glu Ala Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 235 240 



Asp His Ser Leu Tyr Thx Gly Ser Leu Trp Tyx Thr Pro He Arg Arg 
245 . 250 255 



Glu Trp Tyr Tyr Glu Val He He Val Arg Val Giu He Asn Gly Gin 
260 265 270 



Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 



Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 29b 300 



Ala Val Lys Ser He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp 
305 310 315 320 



Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
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Pro Trp Asn lie Phe Pro Val lie Ser Leu lyr Leu Met Gly Glu Val 
340 345 350 

Thr Asn Gin Ser Phe Arg lie Thr lie Leu Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin As? Asp Cys Tyr Lys Phe Ala 
370 375 38C 

Val Ser Gin Ser Ser Thz Gly Thr Val Met Gly Ala Val He Met Glu 
385 390 395 400 

Gly Phe lyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 

Gly Pro Phe Val Thr Ala Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 

Gin Thr Asp Glu Ser Thr Leu Met Thr He Ala Tyr Val Mat Ala Ala 
450 455 460 

He Cys Ala Leu Phe Met Leu Pro Leu Cys Leu Met Val Cys Gin Trp 
465 470 475 480 

Arg Cys Leu Arg Cys Leu Arg His Gin His Asp Asp Phe Ala Asp Asp 
485 490 495 



19 



wo 00^7369 



PCTAJS99/2088I 



He Ser Lou L«u Lys 
500 

<210> 9 
<211> 2088 
<212> DKA 



<213> Hcmo 


sapiens 




- 

<400> 9 








gttCggcact 


gctcccgccg 


cccactga^cf 


gteuiCgc egg 


cctgccggcc 




aca'tgaa C g t 


ccagaatggg 




a*accaagga 


aggcatcctg 


cagat caeca 


atgtggtaga 


agccaaccaa 


ggccgcaagc 


agtgcaagac 


ccatccccac 


gagtttgtaa 


gtgatgccct 


tctcgttcct 


atggatgttc 


gcgaaacrca 


ccctcactgg 


aagagtacca 


acttgcatga 


ctacggcatg 


ggggtagagt 


ttgtgtgttg 


cccactggct 


gcggaggagg 


atgactcgga 


tgtccggtgg 


agtgaagaca 


aagtagtaga 


agtagcagag 


gaagccgatg 


atgacgagga 


cgacgaggat 


ccctacgaag 


aagccacaga 


gagaaccacc 


gagtctgtgg 


aagaggtggt 


tcqagttcct 


gacaagtatc 


tcgagacacc 


tggggatgag 


gagaggcttg 


aggccaagca 


ccgagagaga 


gcagaiacgtc 


aagcaaagaa 


cttgcctaaa 


caggagaaag 


tggaatcttt 


ggaacaggaa 


acacacatgg 


ccagagtgga 


agccatgctc 



gccgcccgga cggctcgggc gccggaggta 60 
gaaccccaga tcgccatgtt ctgtggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtatrgsc aagaagtcta ccctgaactg 240 
ccagtgacca tccagaactg gtgcaagcgg 300 
tttgtgattc cctaccgctg cttagttggt 360 
gacaagtgca aattcttaca ccaggagagg 420 
cacaccgtcg ccaaagagac argcagtgag 480 
ttgctgccct gcggaactga caagtcccga 540 
gaagaaagtg acaatgtgga ttccgctgac 600 
ggcggagcag acacagacta tgcagatggg 660 
gaggaagaag tggctgaggt ggaagaagaa 720 
ggtgargagg tagaggaaga ggctgaggaa 760 
agcattgcca ccaccaccac caccaccaca 840 
acaacagceg ccagtacccc tgatgccgt:t 900 
aatgaacatg cccatttcca gaaagccaaa 960 
atgtcccagg tcacgagaga atgggaagag 102C 
gctgataaga aggcagttat ccagcatttc 1080 
gcagccaacg agagacagca gctggtggag 1140 
aacgaccgcc gccgcctggc cctggagaac 1200 
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tacatcaccg 


ctctgcaggc 


tgttcctcct cggcctcgtc 


asgtgttcaa 


tatgcraciag 


1260 


aagtatgzcc 


gcgcagaaca 


gaaggacaga cagcacaccc 


toaagcattt 


cgagcatgtg 


1320 


cgcatggtgg 


atcccaagaa 


agccgctcag atccggtccc 


aggtracgac 


acacccccgt 


13 80 


gtgatttatg 


agcgcatgaa 


tcagticccLc tccctgctct 


acaacgcgcc 


tgcagcggcc 


1443 


gaggagattc 


aggatgaagt 


tgacgagctg cttcagaaag 


agcaaaacta 


ttcagatgac 


1500 


gtcttggcca 


acatga'ttag 


Cgaaccaagg azcag^tacg 


gaaacgatgc 


tctcatgcca 




cctttgaccg 


aaa cgaaaa c 


caccgCggag ctccttcccg 


tgaatggaga gtccagcc-g 


1620 


gacgatctcc 


agccgtggca 


ttcttttggg gctgactctg 


tgccagccaa 


cacagaaaac 


1680 


gaagttgagc 


ccgttgatgc 


ccgccctgct gccgaccgag 


gactgaccac 


tcgaccaggt 


1740 


tctgggrtga 


caaatatcaa 


gacggaggag atctctgaag 


tgaagatgga 


tgcagaattc 


1800 


cgacatgact 


caggatatga 


agcccaccat eaaaaattgg 


tgtcctttgc 


agaagatgtg 


1860 


ggttcaaaca 


aaggtgcaat 


cattggactc acggtgggcg 


gtgttgtcat 


agcgacagtg 


1920 


aicgtcatca 


cctiggtgat 


gctgaagaag aaacagtaca 


catccattca 


tcatggtgtg 


19&0 


gtggaggtcg 


acgccgctgt 


caccccagag gagcgccacc 


tgtccaagac 


gcagcagaac 


2040 


ggctacgaaa 


atccaaccta 


caagttcttt gagcagatgc 


agaactag 




208B 



<210> 10 
<211> 695 
<212> PRT 

<213> Homo sapiens 
<400> 10 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Mat His Met Asn Val Gin 
35 40 45 
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Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Ttix Lys Thr Cys lie Asp 
5C ' 55 60 



Thr Lys Glu Gly lie Leu Gin Tyr cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 



Gin He Thr Asn Val Vai Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 



Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
IOC 105 110 



He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 12S 



Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val cys 
130 135 140 



Glu Thr His Leu His t*rp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 15C 155 160 



Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 



Asp Lys Phe Arg Gly Val Glu Phe Val cys Cys Pro Leu Ala Glu Glu 
IBO 185 190 



Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 



Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
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Val Val Glu VaX Ala Glu Glu Glu Gla Val Ala Glu Val Glu Glu Glu 
225 230 23£ 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 

Ala Thr Thr Thr Thr Thr Thr Thr Glu ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 32C 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 33C 335 

Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Lou Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 



Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 

370 * 375 380 
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Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyz lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phc 
405 410 415 

Asn Met Leu I^ys Lys lyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg He ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu L>eu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 
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Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asr. 
S45 55G SS5 560 

Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu lie Ser 
580 585 590 

Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 

He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 

His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 865 670 



His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyx Lys 
675 680 635 



Phe Phe Glu Gin Met Gin Asn 
690 695 
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<210> 11 

<211> 2088 ' 

<212> DNA 

<213> Homo sapiens 

<400> 11 

atgctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgczgg cccgctggct 
ctgaacatcc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcatcctg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttgtaa gtgargccct tctcgccccc 
atggatgttt gcgaaactca tcttcactgg 
aagagtacca actcgcatga ctacggcatg 
ggggtagagt tcgtgtgttg cccactggct 
gcggaggagg atgactcgga cgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtctgtgg aagaggtggt tcgagttcct 
gacaagtatc ccgagacacc tggggatgag 
gagaggcttg aggccaagca ccgagagaga 
gcagaacgtc aagcaaagaa cttgcctaaa 
caggagaaag cggaatcrcc ggaacaggaa 
acacacatgg ccagagtgga agccatgctc 
tacatcaccg ctctgcaggc tgttcctcct 
aagtatgtcc gcgcagaaca gaaggacaga 
cgcatggtgg atcccaagaa agccgctcag 
gtgatttacg agcgcatgaa tcagtctctc 
gaggagattc aggatgaagt tgatgagctg 



gccgcccgga 


cggctcgggc 


gctggaggta 


60 


gaaccccaga 


ctgccatgtt 


ctgtggcaga 


120 


aagtgggatt 


cagatccatc 


agggaccaaa 


180 


cagtattgcc 


aagaagtcta 


ccctgaactg 


240 


ccagcgacca 


tccagaactg 


gtgcaagcgg 


300 


tccgtgattc 


cctaccgctg 


cttagLCggt 


3 60 


gacaagtgca 


aatccttaca 


ccaggagagg 


<20 


cacaccgrcg 


ccaaagagac 


atgcagtgag 


480 


ttgctgcccc 


gcggaattga 


caagctccga 


540 


gaagaaagcg 


acaatgtgga 


tcctgctgat 


600 


ggcggagcag 


acacagacta 


tgcagatggg 




gaggaagaag 


tggctgaggt 


ggaagaagaui 


720 


ggtgatgagg 


tagaggaaga 


ggccgaggaa 


780 


agcattgcca 


ccaccaccac 


caccaccaca 


840 


acaacagcag 


ccagcacccc 


tgatgccgtt 


900 


aatgaacatg 


cccattccca 


gaaagccaaa 


960 


atgtcccagg 


tcatgagaga 


atgggaagag 


1020 


gctgaCaaga 


aggcagttat 


ccagcatttc 


1080 


gcagccoacg 


agagacagca gctggcggag 


1140 


aatgaccgcc 


gccccccggc 


cctggagaac 


1200 


cggcctcgtc 


acgcgttcaa 


tacgctaaag 


1260 


cagcacaccc 


taaagcattt 


cgogcatgtg 


1320 


atccggtccc 


aggttatgac 


acacctccgt 


1380 


tccctgctct 


acaacgtgcc 


tgcagtggcc 


144C 


cttcagaaag 


agcfLOaacta 


ttcagatgac 


1500 
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acacgactag 


tgaaccaagg 


aL=agccacg 


gaaacgargc 


cc-cacgcca 


15S0 


tcCttga=cg 


^aacgaaaac 


caccgtggag 


ctccttcccg 


tgaatggaga gttcagcctg 




gacgaccccc 


agccgtggca 


ctccttcggg 


gctgactctg 


tgccagccaa 


cacagaaaac 


1680 


gaagCigagc 


ctgtrgatgc 


ccgccccgct 


gccgaccgag 


gactgaccac 


tcgaccaggt 


1740 


tctgggctga 


caaatatcaa gacggaggag atctctgaac 


tgaacctgga 


tgcagaattc 


1800 


cgacacgact 


caggatatga 


agctcaccat 


caaaaaCtgg 


tgtLcttCgc 


agaagacgtg 


1660 


ggttcaaaca 


aaggcgcaat 


cattggactc 


atgqtgggcg 


gtgttgccat 


agcgacagtg 


1920 


atcgtcatca 


=cttggtgat 


gctgaagaag aaacagtaca 


catccactca 


ccatggcgtg 


1980 


gtggaggttg 


acgccgctgt 


caccccagag 


gagcgccacc 


tgtccaagat 


gcagcagaac 


2040 


ggctacgaaa 


atccaaccta caagttcttt 


gagcagatgc 


agaactag 




2088 



<210> 12 
<211> 695 
<212> PRT 

<213> Homo sapiens 
<400> 12 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Tbr Ala Arg 
15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 



Gin lie Ala Met Phe Cys Gly Arg 
35 43 

Asn Gly Lys Trp Asp Ser Asp Pro 
50 55 

Thr Lys Glu Gly lie Leu Gin Tyr 
65 70 



Leu Asn Met His Met Asn Val Gin 
45 



Ser Gly Thr Lys Thr Cys He Asp 
60 



Cys Gin Glu Val Tyr Pro Glu Leu 
75 80 
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Gin He Thr -Asn Val Val Giu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 



Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 



He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 



Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp val Cys 
130 135 140 



Giu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr cys Ser Glu 
145 150 155 160 



Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 



Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pre Lou Ala Glu Glu 
180 IBS 190 



Ser Asp As3 Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 2O0 205 



Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 



Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 
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Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 

Glu Ala Glu Glu Pre Tyr Glu Glu Ala Thr Glu Arg "^hr Thr Scr lie 
260 265 270 

Ma Thr The Thr Thr Thr Thr ThT Glu Ser Val Glu Glu Val Val Arg 
275 280 285 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 

Giu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 

Glu Trp Giu Giu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Giu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 38C 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 



Tyr lie Thr Ala Leu Gin Aia Val Pro Pro Arg Pro Arg His Val Phe 
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Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 43C 

Thr heu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 

Ala Gin lie Arg Ser Gin Val Met Thr His Lbu Arg Val lie Tyr Glu 
450 455 460 

Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 

Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
4B5 490 495 

Tyr Ser Asp Asp Val Leu Ala Asn Met ;le Ser Glu Pro Arg He Ser 
500 505 510 

Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 



Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
5€g 570 575 
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Thr Arg Pro 'Gly Ser Gly Leu Thr Asn lie Lys Thr Giu Glu He Ser 
580 585 590 



Glu Val Asn Leu Asp Ala Slu Phe Arg His Asp Ser Gly Tyr Glu val 
595 600 605 



His His Gin Lys Leu Val Phe Pbe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 



Gly Ala He He Gly Leu Bet Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 



He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
645 650 655 



His His Gly Val Val Glu val Asp Ala Ala val Thr Pro Glu Glu Arg 
660 665 670 



His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 



Phe Phe Glu Gin Met Gin Asn 
690 695 



<210> 13 
<211> 2088 
<212> DNA 

<213> Homo sapiens 
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<400> 13 

atgctgcccg gtttggcact gcLcctgctg 
cccaccgatg gtaatgctgg cctgctggct 
ctgnacatgc acatgaatgt ccagaatggg 
acctgcattg ataccaagga aggcarc=tg 
cagatcacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccacccccac 
gagcztgtaa gtgatgccct tctcgttcci 
atggatgtct gcgaaactca tcttcactgg 
aagagtacca actcgcacga ctacgg=atg 
ggggtagagt ttgtgtgttg cccactggcr 
g=ggaggagg atgactcgga tgtctggtgg 
agtgaagaca aagtagtaga agtagcagag 
gaagccgatg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 
gagtccgtgg aagaggtggt tcgagtccct 
gacaagtatc tcgagacacc tggggatgag 
gagaggcrtg aggccaagca ccgagagaga 
gcagaacgtc eiagcaaagaa cttgcctaaa 
caggagaaag tggaatcttt ggaacaggaa 
acacacatgg ccagagtgga agccatgctc 
tacatcaccg ctctgcaggc tgttcctcct 
aagtatgtcc gcgcagaaca gaaggacaga 
cgcatggtgg atcccaagaa agccgctcag 
gtgatctatg agcgcatgaa tcagtctctc 
gaggagattc aggatgauigC cgatgagctg 
gtcttggcca acatgattag tgaaccaagg 
tctttgaccg aaacgaaaac caccgtggag 
gacgatctcc agccgtggca ttcttttggg 
gaagttgagc ctgttgatgc ccgccctgct 
tctgggttga caaatatcaa gacggaggag 



gccgcctgga cggcCcgggc 


gccggaggta 


60 


gaaccccaga ttgccatgtt 


c-gtggcaga 


120 


aagtgggatt cagatccacc 


agggaccaaa 


ISO 


cagtattgcc emgaagtcta 


ccccgaa=tg 


240 


ccagtgacca tccagaactg 


gtgcaagcgg 


300 


tttgtgactc cctaccgctg 


cctagtcggt 


360 


gacaagcgca aaCCcttaca 


ccaggagagg 


420 


cacaccgtcg ccaaagagac 


atgcagtgag 


480 


ttgccgcccL gcggaattga 


caagctccga 


540 


gaagaaagtg acaatgtgga 


ttctgctgat 


600 


ggcggagcag acacagacta 


tgcagatggg 


660 


gaggaagaag tggctgaggt ggaagaagaa 


720 


gg;:gatgagg tagaggaaga 


ggccgaggaa 


780 


agcactgcca ccaccaccac 


caccaccaca 


840 


acaacagcag ccagtacccc 


tgargccgct 


9O0 


aatgaacatg cccatttcca 


gaaagccaaa 


960 


atgtcccagg tcatgagaga 


atgggaagag 


1020 


gctgataaga aggcagttat 


ccagcatttc 


1080 


gcagccaacg agagacagca gctggtggag 


1140 


aatgaccgcc gccgcctggc 


cccggagaac 


1200 


cggcctcgtc acgtgttcaa 


tatgctaaag 


1260 


cagcacaccc taaagcattt 


cgagcatgtg 


1320 


acccggcccc aggttaCgac 


acacctccgt 


1380 


cccctDctct acaacgtgcc 


tgcagtggcc 


1440 


ctccagaaag agcaaaacca 


ttcagatgac 


1500 


atcagttacg gaaacgatgc 


tctcatgcca 


1560 


ctccttcccc tgaatggaga gttcagcctg 


1620 


gctgactctg tgccagccaa 


cacagaaaac 


1680 


gccgaccgag gaccgaccac 


tcgaccaggt 


1740 


atctctgaag tgaagatgga 


tgcagaattc 


1800 
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cgacatgact caggatai:ga agttcatcat caaaaattgg tgctcttcgc age^igatgtg 1860 

ggttcaaaca aaggtgcaat cattggaccc atggcgggcg gtgtcgtcac agcgacagtg 1920 

atcttcatca ccttggtgat gctgaagaag aaacagtaca catccattca ccatggcgcg 1980 

gcggaggttg acgccgccgt cascccagag gagcgccacc tgtccaagat gcagcagaac 2040 

ggctacgaaa atccaaccta caagttcttt gagcagatgc agaactag 2088 

<210> 14 

<211> 695 

<212> PRT 

<213> Homo sapiens 

<40C> 14 

Met Leu Pro Gly Leu Ala Leu Leu l^au Leu Ala Ala Trp Thr Ala Arg 

15 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 4C 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 
50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr lie Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
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lie Pro Tyr Arg Cys Leu Val Gly Clu Phe Val Sar Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val Cys 
130 135 140 

Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr cys Ser Glu 
145 150 155 160 

Lys ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly "Xe 
165 170 175 

Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 

Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp ser Asp Val 
195 200 205 

Trp Tip Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 

Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 

Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 ~ 265 270 
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Ala Thr ThT Thr Thr Thr Thr Tlir Glu Ser Val Clu GJm Val Val Arg 
275 280 265 

Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lya Tyr Leu 
290 295 300 

Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Hei Ser Gin Val Met Arg 
325 330 335 

Clu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 38C 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
365 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 
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THr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 ' 440 445 



Ala Gin lie Arg Ser Gin Val Mec Thr His Leu Arg Val He Tyr Glu 
450 45S 460 



Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr hsn Val Pro Ala Val Ala 
465 470 475 480 



Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
4B5 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 



Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu THr Lys Thr Thr 
515 520 525 



Val Glu Leu Leu Pre Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 



Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 



Glu Val Glu Pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 



Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 



Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
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His His Gin Lys Leu Val Phe Phe Aia Glu Asp Val Gly Ser Asn Lys 
£10 615 620 



Gly Ala lie He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 63C 635 640 



He Phe He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr Ser He 
6iS 650 6S5 



His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
650 665 670 



His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
67S 68C 685 



Phe Phe Glu Gin Met Gin Asn 
690 695 



<210> 15 

<211> 2094 

<212> DKA 

<213> Homo sapiens 

<400> 15 

atffctgcccg gtctggcact gctcctgctg 

cccactgatg gtaatgctgg cctgctggct 

ctgeiacatgc acatgaatgt ccagaatggg 

acctgcattg ataccaagga aggcatcctg 



gccgcctgga sggctcgggc gctggaggta 60 
gaaccccaga ttgccatgtt: ctgcggcaga 120 
aagtgggatt cagatccatc agggaccaaa 180 
cagtactgcc Mgaagtcta ccctgaactg 240 
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cagarcacca 


acgiggtaga 


agccaaccaa 


ccagi^gacca 


cccagaaccg 


grgcaagcgg 




ggccgcaagc 


agtgceiagBC 


ccatccccac 


tctgtgattc 


ccCaccgctg 


c btagtcggt 




gagtccgcaa 


gcgacgccct 


CCCcg~Ccct 


gacaagtgca 


aaLCCt^ti^ca 


ccaggagagg 




atggatgttr 


gcgaaactca 


t c t Ccac tgg 


cacaccgccg 


ccaaagagac 


aCgcagcgag 




aagagtacca 


acttgcatga 


ccacggcatg 


ttgctgccct 


gc ggaaccga 


ca«tgc^ccga 




ggggtagagt 


ttgtgtgtcg 


cccacCggct 


gaagaaagtg 


acaa cg -gga 


ttctgctgaiL 




gcggaggagg 


atgactcgga 


tgtctggtgg 


ggcggagcag 


acacagacta 


tgcagatggg 




agtgaagaca 


aagtagtaga 


agtagcagag 


gaggaagaag 


tggctgaggt 


ggaagaagaa 




gaagccgazg 


atgacgagga 


cga tg agga t: 


ggtgatgagg 


tagagg aaga 


ggc tgaggaa 




ccctacgaag 


aagccacaga 


gagaaccacc 


agcattgcca 


ccaccaccac 


caccaccaca 




gag-ctgtgg 


aagaggcgg^ 


tcgagCtcct 


acaacagcag 


ccagtacccc 


tga&gccgtt 




gacaagtatc 


tcgagacacc 


tggggatgag 


aaiigaacacg 


cccat wCcca 


gaaagccaaa 


960 


gagaggcctg 


agg c caAgca 


ccgagagaga 


acgccccagg 


Cca^gagaga 


acgggaagag 




gcagaacgtc 


aagcaaagaa 


c ttgcstaaa 


gctgataaga 


aggcagttat 


ccagcaLttc 




caggagaaag 


tggaatcttt 


ggaacaggaa 


gcagccaacg 


agaga cage a 


gctggtggag 




acacacatgg 


ccagagtgga 


agccat.gctc 


aatgaccgcc 


gccgcc tggc 


c c tggagaa c 




Cacatcaccg 


ctctgcaggc 


tgttcctcct 


cggcctcgtc 


acgtgCtcaa 


tatgctaaag 




aagtatgtcc 


gcgcagaaca 


gaaggacaga 


cagcacaccc 


taaagcattt 


cgagcatgtg 




cgcatggtgg 


atcccaagaa 


agccgctcag 


atccggtccc 


aggttatgac 


acacctccgt 


1380 


gCgattcacg 


agcgcaCgaa 


tcagtctctc 


tccctgctct 


acaacgtgcc 


tgcagtggcc 




gaggagattc 


aggatgaagt 


tgatgagctg 


cttcagaaag 


agcaaaacta 


t tcagatgac 




gccttggcca 


acacgac tag 


t gaa c caag g 


atcagttacg 


gaaacgatgc 


tcccatgcca 




tcCttgaccg 


aaacgaaaac 


caccgtggag 


ctccttcccg 


tgaatggaga 


gttcagccCg 




gacgatctcc 


agccgtggca 


ttctttcggg 


gctgactctg 


tgccagccaa 


cacagaaaac 


1680 


gaagtcgagc 


ctgttgatgc 


ccgccctgct 


gccgaccgag 


gaccgaccac 


tcgaccaggt 


1740 


tctgggttga 


caaatatcaa 


gacggaggag 


atctctgeiag 


tgaagatgga 


tgcagaattc 


leoo 


cgacatgact 


caggatatga 


agctcatcat 


caaaaattgg 


tgttctttgc 


agaagatgtg 


1860 


ggttcaaaca 


aaggtgcaat 


cattggactc 


atggtgggcg 


gtgttgtcat 


agcgacagtg 


1920 


atcgtcatca 


ccttggtgat 


gccgaagaag 


aaacagtaca 


caticcatcca 


tcatggtgtg 


1980 


gtggaggttg 


acgccgctgt 


caccccagag 


gagcgccacc 


tgccceiagat 


gcagcagaac 


2040 


ggctacgaaa 


atccaaccta 


caagttcrtt 


gagcagatgc 


agaacaagaa 


gtag 


2094 



38 



PCT/US99/20881 



<210> 16 

<211> 697 

<212> PRT 

<213> Hoino sapiens 

<400> 16 

Met Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
1 5 10 15 

Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Sly L«u Leu Ala Glu Pro 

20 25 30 

Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys lie Asp 

50 55 60 

Thr Lys Glu Gly lie Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin lie Thr Asn Val Val Glu Ala Asn Gin Pro val Thr lie Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro His Phe Val 
100 105 110 

lie Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 
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Val Pro Asp Lys Cys Lys Phe Leu His Gin Glu Arg Met Asp Val cys 
130 ' 135 140 



Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 150 155 160 



Lys Ser Ttix Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly lie 
165 170 175 



Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 



Ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp ser Asp Val 
195 200 205 



Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly ser Glu Asp Lys 
210 21S 220 



Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 



Glu Ala Asp Asp Asp Glu Asp Asp Glu. Asp Gly Asp Glu Val Glu Glu 
245 250 255 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 



Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 



Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyx Leu 
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Glu Thr Pro Gly As? Glu Asn Glu His Ala His Phe Gin Lys Ala r>ys 
305 310 315 320 

Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met ser Gin Val Met Arg 
325 330 335 

Glu Trp Glu Glu Ala Glu Arg Glr. Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 

Lys Lys Aia Val lie GXn His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 

Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 

Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 

Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 

Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 

Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 



Ala Gin lie Arg Ser Gin Val Met Thr His Leu Arg Val lie Tyr Glu 
450 ~ 455 460 
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Arg Met Asn tiln Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 



Glu Glu lie GXb Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met lie Ser Glu Pro Arg lie Ser 
500 SOS SIO 



Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr 
515 520 525 



Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 



Pro Trp His Ser Phe Gly Ala Asp Ser Vai Pro Ala Asn Thr Glu Asn 
545 550 555 560 



Glu Val Glu Pro Val Asp Aia Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 



Tlir Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Giu He Ser 
580 585 590 



Glu Val Lys Ket Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 



His His Gin Lys Leu Val Pho Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 
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Gly Ala He He Gly Leu Met Val Gly Gly Val Val Ala Ttir Val 
625 ' 630 635 640 



He Val He Thr Leu Val Met h&\i Lys Lys Lys Gin Tyr Thr Ser He 
645 650 6SS 



His His Gly Val Val Glu Val Asp Ala Ala Val Tnr Pro Glu Glu Arg 
650 665 670 



His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 



Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 17 

<211> 2094 

<212> DNA 

<213> HoEio sapiens 

<400> 17 

atgctgcscg gtttggcact gctcctgctg 
cccacLgatg gtaaigctgg cctgctggct 
ctgaacatgc acatgaatgt ccagaacggg 
acctgcattg acaccaagga aggcatcctg 
cagaccacca atgtggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttgtaa gtgatgccct tctcgttcct 
atggatgtt- gcgaaactca tcttcactgg 
aagagtacca acttgcatga ccacggcatg 



gccgcctgga 


cggctcgggc 


gctggaggta 


6D 


gaaccccaga 


ttgccatgtt 


ccgtggcaga 


120 


aagtgggatt 


cagatccatc 


agggaccaaa 


ISO 


cagtattgcc 


aagaagtcta 


ccctgaactg 


240 


ccagtgacca 


tccagaactg 


gtgcaagcgg 


300 


tttgtgattc 


cctaccgctg 


cttagttggt 


3 60 


gacaagtgca 


aattcctaca 


ccaggagagg 


420 


cacaccgccg 


ccaaagagac 


atgcagcgag 


480 


ttgctgccct 


gcggaa-tga 


caagrtccga 


540 
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ffffggnagagt 


ttgtgtgttg 


cccaccggci 


gcggaggagg 


atgactcgga 


tguctggtgg 


agtgaagaca 


aagcagtaga 


agcagcagag 


gaagccgatg 


atgacgagga 


cgatgaggat 


ccctacgaag 


aagccacaga 


gagaaccacc 


gagtctgtgg 


aagaggtggt 


tcgag-tcct 


gacaagt.arc: 


tcgagacacc 


tggggatgag 


gagaggctrg 


aggccaagca 


ccgagagaga 


gcagaacg-c 


aagcaaagaa 


cttgcctaaa 


caggagaaag 


tggaatcLtt 


ggaacaggaa 


acacacatgg 


ccagagtgga 


agccatgctc 


tacatcaccg 


ccccgcaggc 


rgtccctcct 


aag-atgtcc 


gcgcagaaca 


gaaggacaga 


cgcatggrgg 


atcccaagaa 


agccgctcag 


gtgatttatg 


agcgcatgaa 


ccagtctctc 


gaggagattc 


aggatgaagt 


t gat gage tg 


gtcttggcca 


acatgattag 


tgaaccaagg 


tctttgaccg 


aaacgaaaac 


caccgtggag 


gacgatctcc 


agccgtggca 


ttcttttggg 


gaagccgagc 


ctgttgacgc 


ccgccctgct 


tctgggttga 


caaatatcaa 


gacggaggag 


cgacatgact 


caggatatga 


agttcatcat 


ggttcaaaca 


aaggtgcaat: 


cattggactc 


accgtcatca 


ccttggtgac 


gctgaagaag 


gtggaggctg 


acgccgctgt 


caccccagag 


ggctacgaaa 


atccaaccta 


caagttcttt 



<210> 18 
<211> 691 
<212> PRT 

<213> Homo sapiens' 



gaagaaagtg 


acoatgcgga 


ttccgc'gat 




ggcggagcag 


acacagacta 


tgcagatggg 




gaggaagaag 


tggctgaggc 


ggaagaagaa 


720 


ggtgatgagg 


tagaggaaga 


ggctgaggaa 




agcattgcca 


ccaccaccac 


caccaccaca 




acaacagcag 


ccagtacccc 


tgatgccgtt 


900 


aatgaacacg 


cccaCttcca 


gaaagccaaa 




aCgtcccagg 


tcatgagaga 


atgggaagag 


1020 


gctgacaaga 


aggcag L Ca t 


ccagcatttc 




gcagccaacg 


agagacagca 


gctggtggag 


1140 


aatgaccgcc 


gccgcczggc 


cctggagaac 


1200 


cggccCcgCc 


acgtgtccaa 


tatgctaaag 




cage acaccc 


"taaagcaCtit 


cgagcatgtg 




atccggtccc 


aggtcaCgac 


acacctccgc 




tccctgctct 


acaacgtgcc 


tgcagtggcc 


1440 


ctrcagaaag 


agcaaaacta 


ttcagatgac 


1500 


atcagttacg 


gaaacgatgc 


tc t caCgcca 




ctccttcccg 


tgaa tggaga 


gtCcagcctg 




gcCgaccctg 


tgccagccaa 


cacagaciaac 


16S0 


gccgaccgag 


gaccgaccac 


tcgaccaggc 


1740 


atctctgaag 


tgaatctgga 


tgcagaattc 


1800 


caaaaattgg 


tgttctttgc 


agaagatgtg 


1860 


atggtgggcg 


gtgttgtcat 


agcgacagtg 


1920 


aaacagtaca 


catccaCtca 


tcacggtgcg 


1980 


gagcgccacc 


tgcccaagat 


gcagcagaac 


2040 


gagcagatgc 


agaacaagaa 


gtag 


2094 
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<400> 18 

Met I<eu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 

Ala Leu GIu Val "Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pro 
20 2S 30 

Gin lie Ala Met ?he cys Gly Arg Leu Asn Met His Ket Asn Val Gin 
35 40 45 

Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Tlir Lys Thr Cys He Asp 
50 55 60 

Thr Lys Glu Gly He Leu Gin Tyr Cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 

Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 

Trp Cys Lys Arg Gly Arg Lys Gin Cys Lys Thr His Pro Kis Ph© Val 
100 105 110 

He Pro Tyr Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 

Val Pro Asp Lys cys Lys Phe Leu His Gin Glu Arg Mat Asp Val cys 
130 135 140 



Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu 
145 ■ 150 155 160 
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Lys Ser Thr -Asn Lcu His Asp lyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 



Asp Lys Phe Arg Gly Val Glu Phc Val Cys Cys Pro Leu Ala Glu Glu 
180 185 190 



Ser Asp Asn Val Asp ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 



Trp Trp Gly Gly Ala Asp Tlir Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 



Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 



Glu Ala Asp Asp Asp Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser He 
260 26S 270 



Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 235 



Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 



Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 
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Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 



Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 



Lys Lys Ala Val lie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 

355 360 365 



Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 



Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 



Tyr lie Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Phe 
405 410 415 



Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 



Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 



Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 



Arg Met Asn Gin ser Leu ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 



Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
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Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 

lyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Tnr 
515 520 525 

Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu GlD 
530 535 540 

Pro Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 

Glu Val Glu pro Val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Hir 
565 570 575 

Thr Arg Pro Gly Ser Gly Leu Thr Asn He Lys Thr Glu Glu He Ser 
580 585 590 

Glu Val Asn Leu Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Val 
595 600 605 

His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 
610 615 620 

Gly Ala He He Gly Leu Met Val Gly Gly Val Val He Ala Thr Val 
625 630 635 640 



He Val He Thr Leu Val Met Leu Lys Lys Lys Gin Tyr Thr ser He 
64^ 650 655 
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His His Gly'Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 6*70 



His Leu Ser Lys Kec Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
675 680 685 



Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 19 

<21I> 2094 

<212> DMA 

<213> Homo sapiens 

<40D> 19 

atffctgcccg gtttggcact gctcctgctg 
cccactgatg gtaatgctgg cctgctggct 
ctgaacatgc acaCgaatgt ccagaatggg 
acctgcattg acaccaagga aggcaCcctg 
cagaccacca atgrggtaga agccaaccaa 
ggccgcaagc agtgcaagac ccatccccac 
gagtttgtaa gtgatgccct tctcgttcct 
atggatgttt gcgaaactca tcttcactgg 
aagagtacca acttgcatga ctacggcatg 
ggggtagagt ttgtgtgttg cccactggcc 
gcggaggagg atgactcgga tgtctggrgg 
agcgaagaca aagtagtaga agtagcagag 
gaagccgacg atgacgagga cgatgaggat 
ccctacgaag aagccacaga gagaaccacc 



gccgcctgga 


cggctcgggc 


gctggaggca 


6C 


gaaccccaga 


ttgccatgtt 


ctgtggcaga 


120 


aagtgggatt 


caga-ccatc 


agggaccaaa 


lao 


cagtattgcc 


aagaagtcta 


ccctgaactg 


240 


ccagtgacca 


tccagaiaccg 


gtgcaagcgg 


300 


tttgtgattc 


cctaccgctg 


cctagttggt 


360 


gacaagtgca 


aactcttaca 


ccaggagagg 


420 


cacaccgtcg 


ccanagagac 


atgcagtgag 


480 


ttgctgccct 


gcggeiat tga 


caagttccga 


540 


gaagaaagtg 


acaatgtgga 


ttctgctgat 


600 


ggcggagcag 


acacagacta 


tgcagatggg 


660 


gaggaagiiag 


tggctgaggt 


ggaagaagaa 


720 


ggtgacgagg 


tagaggaaga 


ggctgaggaa 


780 


agcattgcca 


ccaccaccac 


caccaccaca 


840 
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gagcctgtgg 


aagaggtggt 


tcgagtccct 


acaacagcag 


ccagtaccc= 


tgatgccgtt 


9 DC 


gacaagtacc 


ttgagacacc 


^gggg^^g^g 


aatgaacarg 


cccatttcca 


gaaagccaaa 


960 


gagaggcttg 


aggccaagca 


ccgagagaga 


atgtcccagg 


tcatgagaca 


atgggaagag 


1C20 


gcagaacgcc 


aagcaaagaa 


ctcgcccaaa 


gccgataaga 


aggcagttat 


ccagcacttc 


1080 


caggagaaag 


rggaatcttt 


ggaacaggaa 


gcagccaacg 


agagacagca 


gctggtggag 


1140 


acacacatgg 


ccagagtgga 


agccatgctc 


aacgaccgcc 


gccgcctggc 


cctggagaac 


12O0 


tacntcaccg 


ccctgcaggc 


tgttcctcct 


cggcctcgte 


acgcgctcaa 


tatgctaaag 


1260 


aagtaLgtcc 


gcgcagaaca 


gaaggacaga 


cagcacaccc 


Laaagcattt 


cgagcacgcg 


1320 


cgcatggtgg 


atcccaagaa 


agccgcCcag 


atccggtccc 


aggttatgac 


acacctccgt 


1380 


gtgatttatg 


agcgcacgaa 


tcagtctctc 


tccctgctct 


acaacgtgcc 


tgcagtggcc 


1440 


gaggagatLc 


aggatgaagt 


tgatgagctg 


cttcagaaag 


agcaaaacta 


tCcagatgac 


1500 


gtcttggcca 


acacgattag 


tgaaccaagg 


atcagtracg 


gaaacgatgc 


tctcatgcca 


1560 


tctttgaccg 


aaacgaaaac 


caccgtggag 


ctccttcccg 


cgaatggaga 


gttcagcctg 


1620 


gacgatctcc 


agcegtggca 


tccttttggg 


gctgactctg 


tgccagccaa 


cacagaaaac 


1580 


gaagttgagc 


ctgttgatgc 


ccgccctgct 


gccgaccgag 


gactgaccac 


tcgaccaggt 


1740 


tctgggtcga 


caaacaLcaa 


gacggaggag 


atctccgaag 


tgaagacgga 


tgcagaatcc 


1800 


cgacatgact 


cagga^atga 


agttcaccat 


caaaaatcgg 


tgttctttgc 


ageuigatgtg 


1860 


ggttcaaaca 


aaggtgoaa- 


cattggactc 


a-ggcgggcg 


gtgtcgtcac 


agcgacagtg 


1920 


atcttcatca 


ccttggtgat 


gctgaagaag 


siaacagtaca 


catccattca 


tcatggtgtg 


1980 


gtggaggttg 


acgccgctgt 


caccccagag 


gagcgccacc 


tgtccaagat 


gcagcagaac 


2040 


ggctacgaaa 


atccaaccta 


caagttcttt 


gagcagatgc 


agaacaagaa 


gtag 


2094 



<210> 20 
<211> 697 

<212> pax 

<213> Homo sapiens 
<400> 20 

Mst Leu Pro Gly Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Ala Arg 
15 10 15 
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Ala Leu Glu VaX Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu Pre 
■20 2S 30 



Gin lie Ala Met Phe Cys Gly Arg Leu Asn Met His Met Asn Val Gin 
35 40 4B 



Asn Gly Lys Trp Asp Ser Asp Pro Ser Gly Thr Lys Thr Cys He Asp 
50 55 60 



Thr Lys Glu Gly He Leu Gin Tyr cys Gin Glu Val Tyr Pro Glu Leu 
65 70 75 80 



Gin He Thr Asn Val Val Glu Ala Asn Gin Pro Val Thr He Gin Asn 
85 90 95 



Trp Cys Lys Arg Gly Arg Lys Gin cys Lys Thr His Pro His Phe Val 
100 105 110 



He Pro T>-r Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 
115 120 125 



Val Pro Asp Lys cys Lys Phe Leu His Gin Glu Arg Met Asp val cys 
130 135 140 



Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ssr Glu 
145 150 155 160 



Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro Cys Gly He 
165 170 175 



Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys Pro Lsu Ala Glu Glu 
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ser Asp Asn Val Asp Ser Ala Asp Ala Glu Glu Asp Asp Ser Asp Val 
195 200 205 



Trp Trp Gly Gly Ala Asp Thr Asp Tyr Ala Asp Gly Ser Glu Asp Lys 
210 215 220 



Val Val Glu Val Ala Glu Glu Glu Glu Val Ala Glu Val Glu Glu Glu 
225 230 235 240 



Glu Ala Asp Asp Ai:p Glu Asp Asp Glu Asp Gly Asp Glu Val Glu Glu 
245 250 255 



Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser lie 
260 265 270 



Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val Arg 
275 280 285 



Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp Lys Tyr Leu 
290 295 300 



Glu Thr Pro Gly Asp Glu Asn Glu His Ala His Phe Gin Lys Ala Lys 
305 310 315 320 



Glu Arg Leu Glu Ala Lys His Arg Glu Arg Met Ser Gin Val Met Arg 
325 330 335 



Glu Trp Glu Glu Ala Glu Arg Gin Ala Lys Asn Leu Pro Lys Ala Asp 
340 345 350 
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Lys Lys Ala Val :ie Gin His Phe Gin Glu Lys Val Glu Ser Leu Glu 
355 360 365 



Gin Glu Ala Ala Asn Glu Arg Gin Gin Leu Val Glu Thr His Met Ala 
370 375 380 



Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Ala Leu Glu Asn 
385 390 395 400 



Tyr He Thr Ala Leu Gin Ala Val Pro Pro Arg Pro Arg His Val Pbe 
405 410 415 



Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gin Lys Asp Arg Gin His 
420 425 430 



Thr Leu Lys His Phe Glu His Val Arg Met Val Asp Pro Lys Lys Ala 
435 440 445 



Ala Gin He Arg Ser Gin Val Met Thr His Leu Arg Val He Tyr Glu 
450 455 460 



Arg Met Asn Gin Ser Leu Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 
465 470 475 480 



Glu Glu He Gin Asp Glu Val Asp Glu Leu Leu Gin Lys Glu Gin Asn 
485 490 495 



Tyr Ser Asp Asp Val Leu Ala Asn Met He Ser Glu Pro Arg He Ser 
500 505 510 
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•ryr Gly Asn Asp Ala Leu Met Pro ser Leu Thr Glu Thr Lys Thr Thr 
515 • 520 S25 



Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp Leu Gin 
530 535 540 



Pre Trp His Ser Phe Gly Ala Asp Ser Val Pro Ala Asn Thr Glu Asn 
545 550 555 560 



Glu Val Glu Pro val Asp Ala Arg Pro Ala Ala Asp Arg Gly Leu Thr 
565 570 575 



Thr Arg Pro Gly Ser Gly Leu Thr Asn lie Lys Thr Glu Glu He Ser 
580 585 590 



Glu Val Lys Met Asp Ala Glu Phe Arg His Asp Ser Gly Tyr Glu Vol 
595 600 605 



His His Gin Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asij Lys 
610 615 620 



Gly Ala lie He Gly Leu Met val Gly Gly val Val lie Ala Tiir Val 
625 530 63 5 640 



He Phe He Thr Leu Val Met Leu Lys I.ys Lys Gin Tyr Thr Ser He 
645 650 655 



His His GXy Val Val Glu Val Asp Ala Ala Val Thr Pro Glu Glu Arg 
660 665 670 



His Leu Ser Lys Met Gin Gin Asn Gly Tyr Glu Asn Pro Thr Tyr Lys 
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Phe Phe Glu Gin Met Gin Asn Lys Lys 
690 695 



<210> 21 

<211> 1341 

<212> DNA 

<213> Homo sapiens 

<400> 21 

acggctagca tgaccggcgg acagcaaacg 
ctgcccctgc gcagcggcct ggggggcgcc 
gacgaagagc ccgaggagcc cggccggagg 
aggggcaagC cggggcaggg ctactacgCg 
cccaacatcc tggtggatac aggcagcagt 
ttcctgcatc gccactacca gaggcagctg 
gtgtatgCgc cctacaccca gggcaagcgg 
atcccccatg gccccaacgt caccgtgcgt 
aagCtcttca tcaacggctc caactgggaa 
gccaggcctg acgactccct ggagcctctc 
cccaacctct tctccctgca cctttgtggt 
ctggcccctg tcggagggag catgatcatt 
agtctctggt anacaccca- ccggcgggag 
gagatcaatg gacaggatcc gaaaatggac 
gtggacagtg gcaccaccaa ccttcgtccg 
tccatcaagg cagcctcctc cacggagaag 
ctggtgtgct ggcaagcagg caccacccct 
ctaatgggtg aggttaccaa ccagtccttc 
cggccagcgg aagacgtggc cacgtcccaa 



ggccgcggat 


ccacccagca 


cggcacccgg 


60 


cccctggggc 


tgcggctgcc 


ccgggagacc 


120 


ggcagctttg 


tggagatggt 


ggacaacctg 


180 


gagatgaccg 


tgggcagccc 


cccgcagacg 


240 


aaccttgcag 


tgggtgctgc 


cccccacccc 


300 


tccagcacat 


accgggacct 


ccggaagggt 


360 


gaaggggagc 


tgggcaccga 


cctggtaagc 


420 


gccaacattg 


ctgccatcac 


tgaatcagac 


4S0 


ggcatcctgg 


ggctggccta 


tgctgagatt 


540 


tttgacrctc 


tggtaaagca 


gacccacgtt 


600 


gctggcttcc 


ccctcaacca 


gtctgaagtg 


660 


ggaggtatcg 


accacCcgct 


gtacacaggc 


720 


tggtatcarg 


aggccaccat 


ccj'gcgggcg 


780 


tgcaaggagt 


acaactatga 


caagagcazt 


840 


cccaagaaag 


tgct^gaagc 


tgcagtcaaa 


900 


Ctccctgatg 


gttcctggct 


aggagagcag 


960 


tggaacattt 


tcccagtcat 


ctcactctac 


1020 


cgcatcacca 


tccttccgca 


gcaatacctg 


lOBO 


gacgactgtt 


acaagtctgc 


catctcacag 


1140 
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ccatccacgg gcactgcrac gggagctgct accacggagg gct^cracgt cgtcrctgat 12 OC 

cgggcccgaa aicgaattgg ctttgccg::c agcgcttgcc acgtgcacga tgagctcagg 126C 

acggcagcgg tggaaggccc ttttgccacc ctggacacgg aagaccgtgg ctacaacatt 13 2C 

ccacagacag atgagccatg a 1341 

<210> 22 

<211> 446 

<212> PRT 

<213> Homo sapiens 

<400> 22 

Kec Ala ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly ser Thr Glr 
15 10 15 

His Gly lie Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu 
20 25 30 

Gly Leu Arg Leu Pro Arg Glu Tbr Asp Glu Glu Pro Glu Glu Pro Gly 
35 40 45 

Arg Arg Gly Ser Phe Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser 
50 55 60 

Gly Gin Gly Tyr Tyr Val Glu Met Thr Val Gly Ser Pre Pro Gin Thr 
65 70 75 80 

Leu Asn lie heu Val Asp Thr Gly ser ser Asn Phe Ala Val Gly Ala 
85 90 95 

Ala Pro Kis Pro Phe Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser 
100 ■ 105 110 
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Thr Tyr Arg Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly 
115 120 125 



Lys Trp alu Gly Glu Leu Gly Thr Asp Leu Val Ser lie Pro His Gly 
130 13S 140 



Pro Asn Val Thr Val Arg Ala Asn lie Ala Ala lie Thr Glu Ser Asp 
145 150 155 16C 



Lys Phe Phe He Asn Gly Ser Asn Trp Glu Gly lie Leu Gly Leu Ala 
165 17C 175 



lyr Ala Glu He Ala Arg Pro Asp Asp Ser Leu Glu Pro Phe Phe Asp 
180 IBS 190 



Ser Leu Val Lys Gin Thr His Val Pro Asn Leu Phe Ser Leu His Leu 
195 200 205 



Cys Gly Ala Gly Phe Pre Leu Asn Gin Ser Glu Val Leu Ala Ser Val 
210 215 22D 



Gly Gly Ser Met He He Gly Gly He Asp His Ser Leu Tyr Thr Gly 
225 230 235 240 



Ser Leu Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He 
245 250 255 



He Val Arg Val Glu He Asn Gly Gin Asp Leu Lys Met Asp cys Lys 
260 265 270 
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Glu Tyr Asn Tyr Asp Lys Ser He Val Asp Sar Gly Thr Thr Asn Leu' 
275 • 28C 285 



Arg Leu Pro Lys Lys Val Phe Glu Ala Ala Val Lys ser lie Lys Ala 
290 295 300 



Ala Ser Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin 
305 310 315 320 



Leu Val Cys Trp Gin Ala Gly Tlir Thr Pro Trp Asn He Phe Pro Val 
325 330 33S 



lie Ser Leu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg lie 
340 34S 350 



■Ehr lie Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr 
355 360 565 



Ser Gin Asp Asp Cys Tyr Lys Phe Ala He Ser Gin ser Ser Thr Gly 
373 375 380 



Thr Val Met Gly Ala Val He Met Glu Gly Phe Tyr Val Val Phe Asp 
385 390 395 400 



Arg Ala Arg Lya Arg He Gly Phe Ala Val Ser Ala Cys His Val His 
405 410 415 



Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp 
420 425 430 



Met Glu Asp Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser 
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43S 440 445 

<210> 22 

<211> 1380 

<212> DNA 

<213> Homo sapiens 



<400> 23 



atggctagca 


tgactggtgg 


acagcaaatg 


ggCcgcggat 








ccgccf tgsi&c 


agga cgga t c 




ggcatccggc 


tgcccccgcg 


cagcggcc^g 


120 


gS3ff9cgccc 


ccctggggct 


gcggctgccc 


cgggagaccg 


acgaagagcc 


cgagg age c c 


180 


gg c c ggziggg 


gcagc tccgc 


ggag a tgg t g 






gggg cagg gc 


240 


'CactacgCgg 


agaLgaccgt 


gggcagcccc 


ccgcagacgc 


tcaacatcct 


gg"C gga t aca 


300 


ggcagcagta 


actttgcagt 


gggtgctgcc 


ccccacccct 


tcctgcatcg 


ctactaccag 


360 


aggcagctgt 


ccagcacata 


ccgggacctc 


cggaagggtg 


tgcatgtgcc 


ctacacccag 


420 


ggcaagtggg 


aaggggagct 


gggcaccgac 


ctggtaagca 


tcccccatgg 


ccccaacgcc 


480 


actgtgcgtg 


ccaacattgc 


tgccatcacc 


gaatcagaca 


agttcctcat 


caacggctcc 


540 


aactgggaag 


gcatcctggg 


gctggcctat 


gctgagattg 


ccaggcctga 


cgactccctg 


600 


gagcccttct 


ttgactctct 


ggtaaagcag 


acccacgttc 


ccaacctctt 


ctccctgcac 


660 


ctttgcggtg 


ctggcttccc; 


cctcaaccag 


tctgaagtgc 


tggcctctgt 


cggagggagc 


720 


atgatcattg 


gaggcatcga 


ccactegctg 


tacacaggca 


gtctctggta 


tacacccatc 


780 


cggcgggagt 


ggtattatga 


ggtcatcact 


gcgcgggtgg 


agatcaatgg 


acaggatccg 


840 


aaaaCggact 


geaaggagta 


caactatgac 


aagagcattg 


tggacagtgg 


caccaccaac 


900 


ctccgtttgc 


ccaagaaagt 


gtttgaagct 


gcagtcaaat 


ccatcaaggc 


agccccctcc 


960 


acggagaagt 


tccctgatgg 


tttctggcta 


ggagagcagc 


rggtgtgctg 


gcaagcaggc 


1020 


accacccctc 


ggaacatcct 


cccagccatc 


ccactccacc 


taatgggcga 


ggtLaccaac 


1080 


cagtccttcc 


gcatcaccat 


ccttccgcag 


caaCacctgc 


ggccagtgga 


agacgi:ggcc 


1140 


acgtcccaag 


acgactgLLa 


caagtttgcc 


atctcacagt 


catccacggg 


cactgttatg 


1200 


ggagctgtta 


-catggaggg 


cttctacgtr 


gtctttgatc 


gggcccgaaa 


acgaattggc 


1260 


tttgctgtca 


gcgct::gcca 


tgtgcacgat 


gagttcagga 


cggcagcggt 


ggaaggccct 


1320 



59 



WO0<VI7369 



PCT/US99/2088I 



trtgtcacct tggacatgga agactgtggc casaacattc cacagacaga tgagtcaiga 13 80 

<210> 24 

<211> 459 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Met Ala Ser Met Thr Sly Gly Gin Gin Met Gly Arg Gly Ser Met Thr 
15 10 15 

lie Ser Acp Ser Pro Arg Glu Gin Asp Gly Ser Thr Gin His Gly lie 
20 25 30 

Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly Ala Pro Leu Gly Leu Arg 
35 40 45 

Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly 

50 55 60 

Ser Phe Val Glu Met Val Asp Asn Leu Arg Gly Lys Ser Giy Gin Gly 
65 70 75 8C 

Tyr Tyr Val Glu Met Tlir Val Gly Ser Pro Pro Gin Thr Leu Asn lie 
as 90 95 

Leu Val Asp Thr Gly ser Ser Asn Phe Ala Val Gly Ala Ala Pro His 
100 ■ 10[> 110 

Pro Phe Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg 
115 120 125 
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Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu 
130 135 140 

Gly Glu Leu Gly Thr Asp Leu Val Ser lie Pro His Gly Pro Asn Val 
145 150 155 160 

Thr Val Arg Ala Asn lie Ala Ala lie Thr Glu Ser Asp Lys Phe Phe 
165 170 175 

lie Asn Giy ser Asr Trp Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu 
180 185 190 

lie Ala Arg Pro Asp Asp Ser Leu Glu Pro Pbe Phe Asp Ser Leu Val 
195 200 205 

Lys Gin Thr His Val Pro Ash Leu Phe Ser Leu His Leu Cys Gly Ala 
210 215 220 

Gly Phe Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser 
225 230 235 240 

Met lie lie Gly Gly He Asp His Ser Leu Tyr Thr Gly Ser Leu Trp 
245 250 255 

Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr Glu Val He He Val Arg 
260 265 270 



Val Glu He Asn Gly Gin Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn 
275 280 285 
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Tyr Asp Lys Ser Tie Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro 
290 ' 29S 3O0 



Lys Lys Val Phe Glu Ala Ala Vai Lys Ser He Lys Ala Ala Ser Ser 
305 310 315 320 



Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys 
325 330 335 



Trp Gin Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu 
340 345 350 



Tyr Leu Met Gly Glu Val Thr Asn Gin Ser Phe Arg He Thr He Leu 
355 360 365 



Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp Val Ala Thr Ser Gin Asp 
370 375 380 



Asp Cys Tyr Lys Phe Ala He Ser Gin Ser Ser Thr Gly Thr Val Met 
38B 390 395 400 



Gly Ala Val He Met Glu Gly Phe Tyr Val Val Phe Asp Arg A:a Arg 
405 410 415 



Lys Arg He Gly Phe Ala Val Ser Ala cys His Val His Asp Glu Phe 
420 425 430 



Arg Thr Ala Ala Val Glu Gly Pro Phe Val Thr Leu Asp Met Glu Asp 
435 440 445 



Cys Gly Tyr Asn He Pro Gin Thr Asp Glu Ser 
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450 455 

<210> 25 

<211> 1302 

<212> DNA 

<213> Mono sapiens 

<400> 25 

atgactcage atggCactcg tctgccactg 
cLgcgCctgc cccgggagac cgacgaagag 
g-ggagatgg tggacaacct gaggggcaag 
gtgggcagcc ccccgcagac gctcaacatc 
gtgggtgctg ccccccaccc ctccctgcat 
taccgggacc tccggaaggg tgtgtatgtg 
ctgggcaccg acctggtaag catcccccat 
gctgccatca ctgaatcaga caagttcttc 
gggctggcct atgctgagat tgccaggcct 
ctggtaaagc agacccacgt tcccaacctc 
ccccicaacc agtctgeiagt gctggcctct 
gaccactcgc tgtacacagg cagtctctgg 
gaggtcatca ttgtgcgggt ggagatcaat 
tacaactazg acaagagcat tgtggacagt 
gtgtttgaag ctgcagccaa atccatcaag 
ggtttctggc taggagagco gctggtgcgc 
ttcccagtca tctcactcta cctaatgggt 
atccttccgc agcaatacct gcggccagtg 
tacaagtttg ccatctcaca gtcatccacg 
ggcttctacg ttgtctttga tcgggcccga 
catgtgcacg acgagtccag gacggcagcg 
gaagactgtg gctacaacat tccacagaca 



cgtageggtc 


cgggtggtgc 


tccactgggc 


60 


cccgaggagc 


ccggccggag 


gggcagcttt 


120 


-cggggcagg 


gctactacgc 


ggagatgacc 


180 


ctggtggata 


caggcagcag 


taacttcgca 


240 


cgctactacc 


agaggcagct 


gtccagcaca 


300 


ccctacaccc 


agggcaagtg 


ggaaggggag 


360 


ggccccsiacg 


tcactgtgcg 


tgccaacact 


420 


atcaacggcc 


ccaactggga 


aggcatcctg 


480 


gacgactccc 


tggagccttt 


ctttgactct 


540 


ttccccctg= 


acctttgtgg 


tgctggcttc 


600 


gtcggaggga 


gcatgatcat 


tggaggtatc 


660 


tatacaccca 


tccggcggga gtggtattat 


720 


ggacaggatc 


tgaasiacgga 


ctgcaaggag 


780 


ggcaccacca 


accttcgttt 


gcccaagaaa 


840 


gcagcctcct 


ccacggagaa 


gttccctgat 


900 


tggcaagcag 


gcaccacccc 


t tggaacat t 


960 


gaggttacca 


accagtcctt 


ccgcaccacc 


1020 


gaagatgtgg 


ccacgtccca 


agacgactgt 


1080 


ggcaccgtta 


tgggagctgt 


tatcatggag 


1140 


aaacgauictg 


gctttgctgt 


cagcgcttgc 


1200 


gtggaaggcc 


cttttgccac 


sttggacatg 


1260 


gat gag teat 


ga 




1302 
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<210> 26 
<211> 433 
<212> PRT 

<213> Homo sapiens 
<4Q0> 26 

Met Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser Gly Leu Gly Gly 
15 10 15 

Ala Pro Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp Glu Glu Pro Glu 
20 25 30 

Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val Asp Asn Leu Arg 



Gly Lys Ser Gly Gin Gly Tyr Tyr Val Glu Met Thr Val Gly Ser Pro 

50 55 60 



Pro Gin Thr Leu Asn He Leu Val Asp Thr Gly Ser Ser Asn Phe Ala 
65 70 75 80 



Val Gly Ala Ala Pro His pro Phe Leu His Arg Tyr lyr Gin Arg Gin 
85 90 95 



Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val Tyr Val Pro Tyr 
100 105 110 



Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp Leu Val Ser He 
115 120 125 
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Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie Ala Ala lie Thr 
130 ' 135 140 

Glu Ser Asp jys Phe Phe He Asr. Gly Ser Asn Trp Glu Gly He Leu 
145 150 15S 160 

Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp Ser Leu Glu Pro 
165 170 175 

Phe Phe Asp Ser I>eu Val Lys Gin Thr His Val Pro Asn Leu Phe Ser 
180 185 190 

Leu His Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin Ser Glu Val Leu 
195 200 205 

Ala Ser Val Gly Gly Ser Met He He Gly Gly He Asp His Ser Leu 
210 215 220 

Tyr Thr Gly Ser Leu Trp Tyr Thr Pro He Arg Arg Glu Trp Tyr Tyr 
225 230 235 240 

Glu val He He val Arg Val Glu He Asn Gly Gin Asp Leu Lys Met 
245 250 255 

Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val Asp Ser Gly Thr 
260 265 270 

Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala Ala val Lys Ser 
275 280 285 

He Lys Ala Ala Ser Ser Thr Glu Lys Phe Pro Asp Gly Phe Trp Leu 
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Sly Glu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr Pro Trp Asn lie 
305 310 315 320 



Phe Pro Val lie Ser beu Tyr Leu Met Gly Glu Val Thr Asn Gin Ser 
325 330 335 



Phe Arg lie Thr lie Leu Pro Gin Gin Tyr Leu Arg Pro Val Glu Asp 
340 345 350 



Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala lie Ser Gin Ser 
355 360 365 



Sor Thr Gly Thr Val Met Gly Ala Val lie Met Glu Gly Phe Tyr Val 
370 375 380 



Val Phe Asp Arg Ala Arg Lys Arg lie Gly Phe Ala val Ser Ala Cys 
385 390 395 400 



His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu Gly Pro Phe Val 
405 410 415 



Thr Leu Asp Met Glu Asp Cys Gly Tyr Asn lie Pro Gin Thr Asp Glu 
420 425 430 
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<211> 1278 
<212> DNA 
<213> Homo sapiens 

<400> 27 

atggctagca cgactggtgg acagcaaatg 
ccgctggact ctggtatcga aaccgacgga 
ggcaagtcgg ggcagggcta ctacgtggag 
aacazcstgg tggatacagg cagcagcaac 
ctgcatcgct actaccagag gcagctgtcc 
tatgt:gccct acacccaggg caagcgggaa 
ccccatggcc ccaacgtcac tgtgcgtgcc 
ttcttcatca acggctecaa ctgggaaggc 
aggcctgacg actccctgga gccttccttt 
aacctsttct ccccgcacct tcgtggtgct 
gcctctgtcg gagggagcat gatcattgga 
ctctggtata cacccatccg gcgggagtgg 
atcaatggac aggatctgaa aatggactgc 
gacagtggca ccaccaacct rcgtttgccc 
atcaaggcag cctcccccac ggag2W!.gttc 
gtgtgctggc aagcaggcac caccccttgg 
atgggtgagg tcaccaacca gtccctccgc 
ccagtggsuig atgtggccac gtcccaagac 
tccacgggca ccgttacggg agccgccarc 
gcccgziaaac gaattggctt tgctgtcagc 
gcagcggtgg aaggcccttt tgtcaccttg 
cagacagatg agtcatga 

<210> 28 
<211> 425 
<212> PRT 



ggtcgcggat cgatgactat ctctgact::t 60 
tcctttgtgg agatggtgga caacctgagg 120 
atgaccgtgg gcagcccccc gcagacgctc 180 
tttgcagtgg gtgccgcccc ccacccctcc 240 
agcacatacc gggacctccg geiagggtgtg 300 
ggggagctgg gcaccgacct ggtaagcatc 360 
aacattgctg ccaccaccga atcagacaag 420 
atcctggggc tggcctatgc ::gagattgcc 480 
gactctctgg taaagcagac ccacgttccc 54 0 
ggcttccccc tcaaccagtc tgaagtgctg 600 
ggtatcgacc actcgcrgta cacaggcagt 6fiO 
tattatgagg tcatcattgt gcgggtggag 720 
aaggagtaca actatgaceia gagcattgtg 780 
aagaaagtgt tLgaagctgc agtcaaatcc 840 
cctgatggtt tctggctagg agagcagctg 900 
aacattttcc cagtcatctc actctaccta 960 
atcaccatcc ttccgcagca atacccgcgg 1020 
gaccgtcaca agtttgccat ctcacagtca 1080 
acggagggct tctacgttgt ctctgatcgg 114 0 
gcttgccatg tgcacgatga gttcaggacg 1200 
gacatggaag actgrggcta caacattcca 1260 

1278 
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•<212> Homo sapiens 
<400> 28 

Met Aia Ser Met Thr Gly Gly Gin Gin Met Gly Arg Gly Ser Met Thr 
15 10 15 

lie Ser Asp Ser Pro IjQU Asp Ser Gly He Glu Thr Asp Gly Ser Phe 
20 25 30 

val Glu Met val Asp Asn Leu Arg Gly Lys ser Gly Gin Gly Tyr lyr 
35 40 45 

val Glu Met Thr Val Gly Ser Pro Pro Gin Tht Leu Asn He Leu Val 
50 55 60 

Asp Thr Gly Ser Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe 
65 70 75 80 

Leu His Arg Tyr Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu 



Arg Lys Gly Val Tyr Val Pro Tyr Thr Gin Gly Lys Trp Glu Gly Glu 
100 105 110 



Leu Gly Tnr Asp Leu Val Ser He Pro His Gly Pro Asn Val Thr Val 
115 120 125 



Arg Ala Asn lie Ala Ala lie Thr Glu Ser Asp Lys Phe Phe He Asn 
130 135 140 



Gly Ser Asn Trp Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala 
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145 153 1S5 160 

Arg l>ro Asp Asp Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin 
165 170 175 

Thr His Val Pro Asn Leu Phe Sex Leu His Leu Cys Gly Ala Gly Phe 
180 185 190 

Pro Leu Asn Gin Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He 
195 200 20^ 

lie Gly Gly He Asp His ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr 
21C 215 220 

Pro He Arg Arg Giu Trp Tyr Tyr Glu Val He He Val Arg Val Glu 
225 230 235 240 

He Asn Gly Glr. Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp 
245 250 255 

Lys Ser He Val Asp Ser Gly Thr Thr Asn Leu Arg Leu Pro Lys Lys 
260 26S 270 

Val Phe Glu Ala Ala val Lys ser He Lys Ala Ala ser Ser Thr Glu 
275 280 285 

Lys Phe Pro Asp Gly Phe Trp Leu Gly Glu Gin Leu Val Cys Trp Gin 
290 295 300 

Ala Gly Thr Thr Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu 
305 ~ 310 315 320 
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H6Z Gly Glu Val Thr Asn Gin Ser 
325 

Gin Tyr Leu Arg Pro Val Glu Asp 
340 

Tyr Lys Phe Ala He Ser Gin Ser 

355 360 

Val He Ket Glu Gly Phe Tyr Val 
370 375 

He Gly Phe Ala Val Ser Ala Cys 
385 390 

Ala Ala val Glu Gly Pro Phe Val 
405 

Tyr Asn He Pro Gin Thr Asp Glu 
420 



Phe Arg He Thr He Leu Pro Gin 
330 335 

Val Ala Thr Ser Gin Asp Asp Cys 
345 350 

Ser Thr Gly Thr Val Met Gly Ala 
365 

Val Phe Asp Arg Ala Arg Lys Arg 
380 

His Val His Asp Glu Phe Arg Thr 
395 400 

Thr Leu Asp Het Glu Asp Cys Gly 
4i0 415 

Ser 
425 



<210> 29 

<211> 1362 

<212> DNA 

<213> Homo sapieos 

<400> 29 

atggcccaag ccctgccctg gctcctgctg tggacgggcg cgggagtgct gcctgcccac 60 
ggcacccagc acggcatccg gctgccccLg cqcagcggcc tggggggcgc ccccctgggg 120 
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ctgcggcrgc 


eccgggagac 


cgacgaagag 


cccgaggagc 


ccggscggag gggcagcCtt 


18C 


gtggagatgg 


t^gacaaccc 


gaggggcaag 


tcggggcagg 


gcia^CaccT^ 


ggacfaccacc 


240 


gtgggcagcc 


ccccgcagac 


gcccaacacc 


ctggcggaca 


caggcagcag 


caacccLgca 


300 


gtgggtgctg 


ccccccacrc 


cttcctgcat 


cgctactacc 


agaggcagct 


gtccagcaca 


360 


Caccgggacc 


tccggaaggg 


tgtgtatgtg 


ccctacaccc 


agggcaagcg 


ggaaggggag 


420 


ctgggcaccg 


acctggraag 


catcccccat 


ggccccaacg 


tcactgtgcg 


tgccaacatt 


480 


getgccacca 


crgaatcaga 


caagtccctc 


atcaacggct 


ccaactggga 


aggcaccccg 


540 


gggccggcct 


ai.gctgagat 


tgccaggcct 


gacgactccc 


tggagccttt 


ctttgactct 


600 


ctggtaaagc 


agacccacgt 


tcccaacctc 


ttctcccrgc 


acctttctgg 


tgccggcttc 


€60 


cccctcaacc 


agcctgaagc 


gctggccccc 


gtcggaggga 


gcatgaccat 


tggaggtatc 


720 


gaccactcgc 


tgtacacagg 


cagtctctgg 


tatacaccca 


tccggcggga gtggtattat 


780 


gaggtcatca 


rcgtgcgggt 


ggagatcaac 


ggacaggacc 


rgaaaatgga 


ctgcaaggag 


840 


tacaactatg 


acaagagca' 


tgtggacagt 


ggcaccacca 


accttcgttt 


gcccaagaaa 


900 


gtgtttgaag 


ctgcagccaa 


atccatcaag 


gcagccccct 


ccacggagaa 


gttcccrgat 


960 


ggtttczggc 


taggagagca 


gctggtgtgc 


tggcaagcag 


gcaccacccc 


ttggaacat t 


1020 


ttcccagtca 


tctcactcca 


cctaatgggt 


gaggttacca 


accagtcctt 


ccgcatcacc 


1080 


atccccccgc 


agcaataccc 


gcggccagtg 


gaagatgtgg 


ccacgtccca agacgactgt 


1140 


tacaagtttg 


ccatcccaca 


gtcatccacg 


ggcacrgtta 


tgggagcLgL 


Latcatggag 


1200 


ggcttcLacg 


ttgtctttqa 


Lcgggcccga 


aaacgaattg 


gctttgctgt 


cagcgcttgc 


1260 


catgtgcacg 


atgagttcag 


gacggcagcg 


gtggaaggcc 


ctrttgtcac 


cttggacatg 


1320 


gaagactgtg 


gctacaacac 


tccacagaca 


gatgagtca- 






1362 



<210> 30 
<211> 453 
<212> PRT 

<213> Homo sapiens 
<400> 30 

Met Ala Gin Ala Leu Pro Trp Leu Leu Lea Trp Met Gly Ala Gly Val 
15 10 15 
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Leu Pre Ale His Gly Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser 

• 20 25 30 

Gly Leu Gly Gly Ala Pro Leu Giy Leu Arg Leu Pro Arg Glu Thr Asp 
35 40 45 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys sar Gly Gin Gly Tyr Tyr Val Glu Mer Thr 
65 73 75 80 

Val Gly Ser Pro Pro Gin Thr Leu Asn lie Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pre His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu Ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pre Tyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
13C 135 140 

Leu Val Ser lie Pro His Gly Pro Asn Val Thr Val Arg Ala Asn lie 
145 ISO 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
165 170 175 

Glu Gly He Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pre Asp Asp 
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Ser Leu Glu Pro Phe Phe Asp Ser Leu Val Lys Gin 'Thr His Val Pro 
19S 200 205 



Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 



Ser Glu Val Leu Ala Ser Val Gly Gly Ser Met He He Gly Gly He 
225 230 23b 240 



Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Tiir Pro He Arg Arg 
245 * 250 255 



Glu Trp Tyr Tyr Glu Val He He Val Arg Val Glu He Asn Gly Gin 
260 265 270 



Asp Leu Lys Met Asp Cys Lys Glu Tyr Asn Tyr Asp Lys Ser He Val 
275 280 285 



Asp Ser Giy Thr Thr Asn Leu Arg Leu Pro Lys Lys Val Phe Glu Ala 
290 295 300 



Ala Val Lys Ser He Lys Ala Ala ser Sor Thr Glu Lys Phe Pro Asp 
3C5 310 315 320 



Gly Phe Trp Leu Giy Glu Gin Leu Val Cys Trp Gin Ala Giy T.ir Thr 
325 330 335 



Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 ~ 345 350 



73 



wo 00/17369 



PCT/US99/2088I 



Thr Asn Gin 'Ser Phe Arg lie Thr lie Le- Pro Gin Gin Tyr Leu Arg 
355 360 365 

Pro Val Glu Asp Val Ala Ttir Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 38C 

lie Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val lie Met Glu 
3B6 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg He Gly Phe Ala 
405 410 415 

Val Ser Ala Cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 

Gly Pro PhG Val Thr Lea Asp Met Glu Asp Cys Gly Tyr Asn He Pro 
435 440 445 

Glu Thr Asp Glu Ser 
450 



<210> 31 

<211> 1380 

<212> DNA 

<213> Homo sapiens 

<400> 31 

aiggcccaag ccctgccccg gctcctgctg tggatgggcg cgggagtgct gcctgcccac 60 
ggcacccagc acggcatccg gctgccccCg cgcagcggcc tggggggcgc ccccctgggg 120 
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ccgcggccgc 


cccgggagac 


cgacgaagag 


cccgaggagc 


ccggccggag 


gggcagctt:: 


180 


gtggagacgg 


tggacaacct 


gaggggcaag 


tcggggcagg 


gctactacgt 


ggagatgacc 


240 


gtgggcagcc 


ccccgcagac 


gctcaacatc 


ctggtggata 


caggcagcag 


caactccgca 


300 


gtgggcgctg 


ccccccaccc 


ctccccgcaw 


cgctaccacc 


agaggcagst 


gtccagcaca 


360 


caccgggacc 


tccggaaggg 


cgcgcatgtg 


ccctacaccc 


agggcaagtg 


ggaaggggag 


420 


ctgggcaccg 


acctggtaag 


catcccccat 


ggccccaacg 


tcactgcgcg 


tgccaacatt 


480 


gccgccatca 


ctgaatcaga 


caagctcttc 


atcaacggct 


ccaactggga 


aggcatcctg 


540 


gggctggcct 


atgctgagat 


"gccaggcct 


gacgactccc 


tggagccttt 


ctttgactct 


600 


ctggtaaagc 


agacccacgt 


ccccaacctc 


tcctccctgc 


acctttgtgg 


tgctggcttc 


660 


ccccccaacc 


agcccgaagt 


gccggcctct 


gtcggaggga 


gcatgatcac 


tggaggtatc 


720 


gaccactcgc 


tgtacacagg 


cagtctctgg 


Catacaccca 


tccggcggga 


gtggtattat 


780 


gaggccatca 


ttgtgcgggt 


ggagatcaaC 


ggacaggatc 


rgaaaatgga 


cCgcaaggag 


840 


tacaacCatg 


acaagagcat 


tgtggacagc 


ggcaccacca 


accttcgctt 


gcccaagaaa 


900 


gtgtctgaag 


ctgcagtcaa 


acccatcaag 


gcagccccct 


ccacgcagaa 


gttccctgat 


960 


ggtctctggc 


taggagagca 


gctggtgtgc 


Cggcaagcag 


gcaccacccc 


LtggaacatC 


1020 


ttcccagrca 


tctcactcta 


cctaatgggt 


gaggttacca 


accagtcctt 


ccgcatcacc 


1080 


atccttccgc 


agcaatacct 


gcggccagtg 


gaagatgtgg 


ccacgtccca 


agacgactgt 


114 0 


tacaagtrtg 


ccatctcaca 


gtcatccacg 


ggcactgtta 


tgggagctgt 


tatcatggag 


1200 


ggcttctacg 


tcgtctttga 


tcgggcccga 


aaacgaattg 


gctttgctgt 


cagcgcttgc 


1260 


catgtgcacg 


atgagztcag 


gacggcagcg 


gtggaaggcc 


cttttgtcac 


cttggacatg 


1320 


gaagactgtg 


gctacaacat 


tccacagaca 


gatgagtcac 


agcagcagca 


gcagcagtga 


1380 



<210> 32 
<211> 439 
<212> ?RT 

<213> Homo sapiens 
<400> 32 

Met Ala Gin Ala Leu Pro Trp Leu Leu Leu Trp Met Giy Ala Gly Val 
15 10 15 
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Leu Pro Ala His Gly Thr Gin His Gly lie Arg Leu Pro Leu Arg Ser 
•20 25 30 

Gly Leu Gly Gly Ala Pre Leu Gly Leu Arg Leu Pro Arg Glu Thr Asp 
2S 40 4S 

Glu Glu Pro Glu Glu Pro Gly Arg Arg Gly Ser Phe Val Glu Met Val 
50 55 60 

Asp Asn Leu Arg Gly Lys Ser Gly Gla Gly Tyr Tyr Val Glu Met Thr 
65 70 75 80 

Val Cly Ser Pro Pro Gin Thr Leu Asn He Leu Val Asp Thr Gly Ser 
85 90 95 

Ser Asn Phe Ala Val Gly Ala Ala Pro His Pro Phe Leu His Arg Tyr 
100 105 110 

Tyr Gin Arg Gin Leu ser Ser Thr Tyr Arg Asp Leu Arg Lys Gly Val 
115 120 125 

Tyr Val Pro lyr Thr Gin Gly Lys Trp Glu Gly Glu Leu Gly Thr Asp 
130 135 140 

Leu Val Ser He Pro His Gly Pro Asn Val Thr Val Arg Ala Asn He 
145 150 155 160 

Ala Ala He Thr Glu Ser Asp Lys Phe Phe He Asn Gly Ser Asn Trp 
155 170 175 



Glu Gly lie Leu Gly Leu Ala Tyr Ala Glu He Ala Arg Pro Asp Asp 
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ser LeJ Glu Pro Phe Phe Asp Ser Leu Val Lys Gin Thr His Vai Pro 
195 200 205 



Asn Leu Phe Ser Leu Gin Leu Cys Gly Ala Gly Phe Pro Leu Asn Gin 
210 215 220 



Ser Glu Val Leu Ala Ser Val Gly Gly Ser Ket lie lie Gly Gly lie 
22b 230 23b 240 



Asp His Ser Leu Tyr Thr Gly Ser Leu Trp Tyr Thr Pro lie Arg Arg 
245 250 255 



Glu Trp Tyr Tyr Glu Val He He 
260 

Asp Leu Lys Met Asp Cys Lys Glu 
275 280 

Asp Ser Gly Thr Thr Asn Leu Arg 
290 295 

Ala Val Lys Ser lie Lys Ma Ala 
305 310 



Val Arg Val Glu He Asn Gly Gin 
265 270 

Tyr Asn Tyr Asp Lys Ser He Val 
285 

Leu Pro Lys Lys Val Phe Glu Ala 
300 

Ser Scr Thr Giu Lys Phe Pro Asp 

315 320 



Gly Phe Trp Leu Gly Giu Gin Leu Val Cys Trp Gin Ala Gly Thr Thr 
325 330 335 



Pro Trp Asn He Phe Pro Val He Ser Leu Tyr Leu Met Gly Glu Val 
340 " 345 350 
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Thr Aan Gin 'Ser Phe Arg lie Thr lie Leu pro Gin Gin Tyr Leu Arg 
355 ^ 360 365 

Pro Val Glu Asp Val Ala Thr Ser Gin Asp Asp Cys Tyr Lys Phe Ala 
370 375 380 

lie Ser Gin Ser Ser Thr Gly Thr Val Met Gly Ala Val lie Met Glu 
385 390 395 400 

Gly Phe Tyr Val Val Phe Asp Arg Ala Arg Lys Arg lie Gly Phe Ala 
405 410 415 

val Ser Ala cys His Val His Asp Glu Phe Arg Thr Ala Ala Val Glu 
420 425 430 

Gly Pro Phe Val Thr Leu Asp Met Glu Asp Cys Gly lyr Asn lie Pro 
43b 440 445 

Gin Thr Asp Glu Ser His His His His His His 
450 455 

<210> 33 

<211> 25 

<212> PRT 

<213> Koiao sapiens 

<400> 33 

Ser Glu Gin Gin Arg Arg Pro Arg Asp Pro Glu Val Val Asn Asp Glu 
1 5 10 15 



78 



wo OO/l 7369 PCT/US99/2088 1 



Ser Ser Leu Val Arg His Arg Trp Lys 



<210> 34 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Ser Glu Gin Leu Arg Sin Gin His Asp Asp Phe Ala Asp Asp He Ser 
15 10 15 

Leu Leu Lys 



<210> 35 
<211> 29 
<212> DNA 

<213> Horns sapiens 
<400> 35 

gtggatccac ccagcacggc atccggccg 

<210> 36 

<211> 36 

<212> DNA 

<213> Homo sapiens 
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<400> 36 



gaaagcCLLc 



atgacLcatc tgcctgtgga acgttg 



<210> 37 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<400> 37 

gatcgacgac catctccgac tctccgcgtg aacaggacg 39 

<210> 38 

<211> 39 

<212> DNA 

<213> Homo sapiens 



<210> 39 
<211> 77 
<212> DKA 

<213> Artificial Sequence 
<22Q> 

<223> Description of Artificial Sequence: Hu-Asp2 
<403> 39 

cggcatccgg ctgcccctgc gtagcggcct gggtggtgct ccactgggtc tgcgtctgcc 60 
ccgggagasc gacgaag 77 



<400> 38 



gatccgtcct 



gttcacgcgg agagtcagag atagtcatc 



39 
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<210> 40 
<211> 77 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Oescriptior. of Artificial Sequence: Hu~Asp2 
<4O0> 40 

ct-cgtcggc ctcccggggc agacgcagac ccagtggagc accacccaga ccgctacgca 60 
ggggcagccg gacgccg 77 

<210> 41 
<211> 51 
<212> DNR 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: CEispase 8 
Cleavage site 

<400> 41 

gatcgatqac tatctctgac tctccgctgg actctggtat cgaaaccgac g 51 

<210> 42 
<211> 51 
<212> ENA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: Caspasc 8 
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Cleavage Site 
<400> 42 

gaLccgCcgg tttcgatacc agagcccagc ggagagrcag agatagccat c 51 



<210> 43 

<211> 32 

<212> DNA 

^213> Homo sapiens 

<400> 43 

aaggaicctc cgtggagatg gtggacaacc tg 

<2i0> 44 

<211> 36 

<212> DNA 

<213> Homo sapiens 



32 



<400> 44 

gaaagctttc atgactcatc tgtctgtgga atgctg 3 6 

<210> 45 
<211> 24 
<212> DNA 

•c213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 6-His tag 
<400> 45 

garcgcatca tcaccatcac catg 24 
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<210> 46 
<211> 2A 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence: S-His tag 
<400> 46 

gatccatggt gatggtgatg acgc 24 

<210> 47 
<211> 354 
<212> ONA 

<213> Artificial Sequence 
<220> 

•<:223> Description of Artificial sequence: Introduce KK 
motif 

<40C> 47 

bbttaanvtt nnnnngactg accactcgac caggttcbnr laaonhadata ragrahntsn 60 

ayrsksOsna yrtawsddcg tmsnvfrmans ymbarahrOg Hctgaccact cgaccaggtt 120 

csnayrsnay rhOdtgactg accactcgac caggttcacL snayrctcsn asnanxmadt 180 

csnayrtcna mcrstwrdOt dthharmaca hngactgacc actcgaccag gttcctdgda 240 

nObdOcdaOO aOcaOrtntr ygtabwrddc mntsnimaryn nnatndcmnt sinnarynnna 300 

tnsksOycihb abctrhvgrr ccrOrsmcrs twrddcmntm swrddcwrdd cinnt 354 

<210> 48 
<211> 462 
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<212> DKA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Introduce KK 
motif 

•c400> 46 

bbttaanttn nnntaicgaat taaattccag cacactggct acttcttgtt ctgcatctca 60 
aagaacbnnn actnhadatar agrahntsna yrsksOsnay rtawsddcgt asnwrmansy 120 
mbarahrOcg aattaaattc cagcacactg gctactcctt gttctgcatc tcaaagaacs 180 
nayrsnayrh Dhtcgaatta aactccagca cactggctac ttcttgttct gcatctcaaa 240 
gaacgaasna yrttcsnasn anrnadtcsn ayrtcnamcr stwrdOcgXs kdliharmaca 300 
hncgaattaa attccagcac actggctact tcttgttctg catctcaaag aacttdgdan 360 
ObOcdaOOaO caOrtntryh kktobwrddc mntsirmaryn rmatndcmnt smmarynniia 420 
tntdccmbbc tckkmcrstw rddonntmsw rddcwrddcm nt 462 

<210> 49 
<211> 380 
<2I2> DNA 

<213> Artificial sequence 
c220> 

<223> Description of Artificial Sequence: Introduce KK 
motif 

<400> 49 

bbttaanttn nnnnmcgaat taaattccag cacactggct abnrmacmha dataragrah 60 
ntsnayrsks Osnayrtaws ddcgcmsnwr mansymbara brOcgaatta aattccagca 120 
cactggctas nayrsnayrh Odhegaatta aattccagca cactggctag aasnayrttc 180 
snasnanrma dtcsnayrtc namcrstwrd Ocmdhtaanna cahncgaatt aaattccagc 240 
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acactggcta ttdgdanObC cdaOOaOcaO 
tndcmursnan afynnrsatns ksOycmbrrmc 
dcmntmswrd dcvrddcmnt 



rtntrynOanr abwrddcmnt simraryiirBia 300 
rbanbctkmk nigOgOgccrO rsiricrstwrd 36C 

38C 
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CI Alms Nos.: claims 32. 33. 35, 36. 76. 71. 76. 78 and 79 and partially 
claims 1. 16. 28. 44. 61. 72 and 141 



Present claims 1. 18. 28. 44. 61. 72 and 141 relate to an extremely large 
number of possible products. In fact, the claims enconpass so many 
possible conpounds that a lack of clarity (and/or conciseness) within the 
meaning of Article 6 PCT arises to such an extent as to render a 
meaningful search of the clains inpossible. 

Moreover, in view of the large nuntier and also the wording of the claims 
presently on file, which renders it difficult, if not inpossible. to 
determine the matter for which protection is sought, the present 
application falls to comply with the clarity and conciseness requirements 
of Article 6 PCT (see also Rule 6.1(a} PCT) to such an extent that a 
meaningful search is itipossible. 

In addition, the obscure definition of claims 32. 33. 35. 36. 7Q. 71, 76. 
76 and 79. relating to an unidentified SEQ ID. and referring to the 
exanples renders as well the search of these claims Impracticable. 

Consequently, the search has been carried out for those parts of the 
application which do appear to be clear, namely the particular sequences 
SEQ ID NOs.: 1. 2, 3, 4. 5, 6, and 8» variants* and uses thereof 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no International 
search report has been established need not be the subject of an 
International preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is nonmlly not to carry out a 
preliminary examination on matter which has not been searched. This Is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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Tills International Searching Authority found multiple [groups of) 
inventions in this International application, as follows: 

1. Claims: 1-31» 34» 37-69» 72-75, 77, 88-129. 
136-148 and partially 141 



Proteases capable of cleaving the beta secretase cleavage 
site of APP. variants thereof; polynucleotides encoding 
them; vectors and host cells cooprlsing the sane: antibodies 
for the polypeptides and uses of the foregoing in screening 
tests. 



2. Claims: 138-135 and partially 141 

APP isofonn Mherein the last two carboxy terminus amino 
acids are Lysine residues. 
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